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Introduction. Contact transformations have had their applications pri- 
marily in dynamics and to a rather small extent in geometry. This is no doubt 
due to the fact that the geometrical object to which a contact transformation 
may be applied is of a more complicated nature than the objects to which we 
apply extended point transformations. The present paper, although far from 
completely solving the problem, indicates how the invariant theory of homo- 
geneous contact transformations may be formulated.* We first define a contact 
frame (§4) which is sufficient for the study of Grassmann’s forms of exterior 
multiplication. We then introduce a linear connection (§§5, 6, 7) for the affine 
invariant theory and finally we introduce a metric, or rather a part of it, that is 
necessary for the study of the metric properties of curves. Incidentally we 
generalize the Poisson brackets of both kinds and find a new and rather simple 
way of characterizing homogeneous contact transformations. 


1, Homogeneous contact transformations. A homogeneous contact trans- 
fomation is defined by the equations 


(ll) = pial, +++ a" p++ pn) = G2, p), Di = ¥ilz; p) 


where yg‘ and y; are such that the condition! 


(1.2) pidt' = pdr’, 

that is 

(1.3) Vi (22 dz* + ae apa) = Padx* 
ox" ODa 


holds for arbitrary values of the differentials dx*, dpa. From (1.1) we have 


. dg dg' i Ov; OY; 
(1.4 d= = a — => — dx* — d a 
) ek dx + _ dpa, dp aa O + in. Pp 


and consequently the above requirement leads to the conditions 


(1.5) . oz" = y: az! 
Pe he 


*We are aware of only one attempt in this direction, due to T. Hosokawa (Journal of 
Hokkaido Imperial Univ., Series 1, V. II, No. 3) but he attacks the problem under very 
“vere restrictions and makes assumptions that are not necessarily justifiable. 

‘The customary summation convention is used throughout this paper. 
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It can be shown that ¢‘ and y are homogeneous in the p’s of degrees zero and 


one respectively.” 


Differentiating each of the equations (1.5) with respect to x* and pg, we 
obtain from the resulting equations the equations 








Op: om Op OF _ Ok OB OL AM _ ag 
ax® ax* daa ax? . dps Ox dx Apg “" 
. (1.6) Payee 
Op Om ah aR _ 
OPs OPa OPa Ops 
From (1.4) and (1.6) we have 
OD: _ oF _, OD: . oF 
D = —— = dz ar 2 — J ; a 7" 
(1.7) di ag dp; i oe, ro ~ Ge dp, 


Since each contact transformation has a unique inverse, we have 


oe ae 
«= — dz + — dp; 
dx age of oD: Di, 


dpa = 


ae dz +2 m5 Ops. 


From these equations and (1.7), since the differentials dz‘, o are independent, 








oh ‘een 
ODa 


, az* sa 


_ Opa OX® AtF Apa _ a 


Op; Oz 
If we put 
ou du 


we have 
Op; _ dx OPa _ da’ ax* 
(1.8) apa of dp; ad” ap, 
From these equations and (1.6) we obtain 
OPa OPs Pp OPa 
a (Pa OMe _ Ne Ole _ 
(De: Ps) = 35" agi — ap, oat ~ 
(1.9) , 
Ox* Ox 
(x , 7) OD; az = 
These are equations for the inverse of (1.1). 
ou dv 
1.10 pe ~ = la <= SON Nh 
(1.10) (u, v ap, ax" 


then for the transformation (1.1) we have 


(1.11) (#, #) = 0, 


(pi, #’) = 53 : 


ODa dx” 


(pi, Di) = 0. 


Equations (1.8), or their equivalent (1.11), constitute a necessary and sufficient 
condition that equations (1.1) define a homogeneous contact transformation.’ 


2. Configurations and their transformations. 
Aaa = 1,-:- 


(1.4), we consider 2n functions A¢, 





Proceeding from equations 
n) of the 2’s and p’s, and 


2 Cf. C. G., p. 240. A reference of this kind is to Eisenhart’s Continuous Groups of 


Transformations. 
3 C.G., p. 250. 
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(2.3) 
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under a contact transformation (1.1) the 2n functions A‘, A; of the #’s and j’s 
defined by 
az az’ ~ Op 

A@ ae od 


F Opi 
A; Ss Ae A. . 
ox* ox* T 5p. dpa 





(2.1) Ai = 


We say that the A’s are the components of an entity in V,, a configuration of the 
first order, which is transformed into the entity of components A under the 
contact transformation. Thus the differentials dx+, dp. are transformed into 
dé, dp;, and are the components of a configuration. Also in consequence of 
(1.5) the configuration 


(2.2) At = 0 Aa = Da 
is transformed into A‘ = 0, A; = p;. Likewise for any configuration we have 
(2.3) pili = PaA*. 


In consequence of (1.6), and analogously to (1.7), equations (2.1) may be 
written in the form 


~, Op' az ~,0p: << 0% 
. ‘- — A; — = Ae, At — — A; — = — Ag. 
24) . aa apa ax? 
If we differentiate the scalar relation 
F(z, Dd) = f(z, Pp) 


with respect to +* and p,, and compare the resulting equations with (2.4), we see 
that the configuration 


(2.5) A@ = Of Ag = of 
~~ Da Ox* 

is transformed into 
o. ; _ Ff 
ee Oe 


If A", Az and M®, M, are the components of two configurations in the z’s and 
V's, we have from (2.1) and (1.6) that 


(2.6) A‘M; — A,M‘ = AM, — A.M*. 


When in particular, we take for A and M expressions of the form (2.5) in terms 
of two functions f and g, we have 


(f,9) = (F,9) 


which is the well-known result that the parenthesis of Poisson of two =o. 
8 invariant under a homogeneous contact transformation.‘ Also (2.3) is 
‘pecial case of (2.6) for A%, Aq and 0, pe. 


ja aaah 
‘C.G., p. 250. 
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If we put 


(2.7) A%® = AeMBS, 


and similarly for A and M, we have ei means of (2.1) 


Ai = OU a” 


1 


i 


(2.8) 


pm 


A; = 


Any sets of quantities satisfying these equations under a contact transformation 
we call the components of a configuration of the second order. 








Ag* = AgM?2, 


Oz' Oz? 


idee apa axe 


ax* ODi 4 


rv 5 apa ane A 


Op; az’ 4 


a+ ODa axe 





Opi OD; Acs 4 CPi Opi se 44 Opi OD; [Pi 4 8 


Opa ax® 


the definition of configurations of higher order. 
From (2.8) we have for the particular case 


(2.9) A*é = 0, 


the components A are of the same form. Thus (2.9) define a numerical con- 


figuration of the second order. 
From (2.1) and (2.8) it follows that 


— A, A, A*A ap _ A, A%g 


(2.10) 


are the components of a configuration of the first, order, as are also 


— AAs, Aas AP — 


(2.11) 


A,f = —5,8 : 


In particular for the case (2.2) we have that 


(2.12) 


at | 


ae t+ — 


A? + 


Aas = AaMsg 
0z* az! 
" Opa Opp 
az? Op; 
Opa Op» Avs 
Op; 2 
ap 
ODa om 
OP: OP; 4 


AdPAg. 


Pa A, paA%s and A* pg, Aa’ pg 


are components of two configurations of the first order. 


3. A subgroup of the general analytic group. The group of homogeneous 
contact transformations may be looked upon as a sub-group of the general 
- , 22” be these variables with the 
We denote the configuration A‘, Aa by 4, 
with the understanding that a superscript n + 7 is replaced by a subscript ?. 


analytic group in 2n variables. 
understanding that 2*** = p,. 


Let 2, - 


In this notation equations (2.1) become 


(3.1) 
and (2.8) are 
(3.2) 


4 = AB 


=A 
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Later (§3) we give 
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A,B=1,-:+,2n 
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By the use of this notation we may write down readily the equations of trans- 
formation of a configuration of any order. 
The configuration (2.9) may also be written as 


33) @«#=+1lforB=A+n, —lforB=A-—n, 0 otherwise. 


If we require that the configuration «42 be invariant under the transformation 
(3.2) we obtain (1.11). Hence: 

3.1] A necessary and sufficient condition that an analytic transformation 
#4 = £4(x) be a homogeneous contact transformation is that the antisymmetric 
configuration «48 be invariant under the transformation. 

Since | «42 | = + 1, it follows from (3.2) for this configuration that A? = 1, A 
being the Jacobian of the transformation. For an infinitesimal transformation 
ji = 74 + E4e, A = 1 + Ape’ and since the transformation is continuous we have: 
[3.2] The Jacobian of a homogeneous contact transformation is equal to +1. 

We refer to the foregoing configurations as contravariant, and to those, as 
Aus, a8 covariant, understanding that for A = n + a the index is a super- 
script. Its law of transformation is 


: ax? ax? 
(3.4) As _ Acp az ax® 


In particular we have the skew-symmetric numerical configuration e,, taking 
the values as in (3.3). If we put A’4 = e4nA%, we have A’, = Ag, A’* = —A*. 


. Thus equations (2.4) may be written, in consequence of (1.8) 


ox? 
3.5 ‘, = A’, ——. 
(3.5) A's ° oa 


Likewise we have mixed configurations, or A4z, the first index being a subscript 


iforA = n + 7, ete. 
The relations 


A 
(3.6) €4¢ enc = —€Cecn = A écn = 53 


will be useful in what follows. 

An important subgroup of the full linear group is what Wey] calls the complex 
group; it is of determinant + 1 and bears the same relation to the non-singular 
skew-symmetric bilinear form {A, B} = A*B, — AqB* that the orthogonal 
group does to the symmetric form (A, B) = A*B, + A.B*. Therefore the 
basic scalar invariant of the group of homogeneous contact transformations is 
de* ip, — dpy dx; it completely characterises the group of homogeneous contact 
transformations. 


4. The contact frame. In order to determine the significance of the com- 
ponents A*, Ag of a configuration in a V, of codrdinates +* we consider the 
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special contact transformations, which are extended point transformations, 


where ¢g’ in (1.1) do not involve the p’s; we write them thus 


axe 


(4.1) ci=giz), p= 5n% PB» 


the latter being a consequence of (1.5). In this case equations (2.1) become 


se ! _ op; ap; _ ap; 
+= __ Aa i; A@ Ag Ae 
(4.2) <A er A oa + OD = 7 +=. iA 











the latter following from (1.8). From the first of these equations it follows that 
A*, which may involve the p’s, are components of a contravariant vector, and 
from the second of (4.1) that p. are the components of a covariant vector. 

- Suppose an affine connection is assigned to the V, by means of the equations 


a? 28 _ lk ax8 B ax? dx® 


(4.3) agtogi 4 agit — 178 aah anh" 








From these equations and (4.1) we obtain 


ap; ax8 aay ax®\ aa 1, ax 
(4.4) Di == cme(Tit Se re —, oS rites - Ti aPp- 





aa ** oe 7 aa’ aa! os ** ax’ 
By means of this result the second set of equations (4.1) may be written 


Ox 


itp 
A, ~ a*F i = A, ~— PE) — 


Consequently if we put 
(4.5) Ae = A, — A*T2 5p, ? 


then d, are the components of a covariant vector in V,. Accordingly if we have 
a configuration A*, A., then A* = A* and \* given by (4.5) are the components 
of two vectors in V,, and conversely if we have two vectors we get a configura- 
tion. 

The preceding result followed the introduction of an affine connection in V, 
and the use of extended point transformations. In any case a configuration 
A4 may be assigned a set of components \4 defined by the equations 


(4.6) a= Ae Ne = Ae — Tag AP 


where I'ag are functions of the z’s and p’s homogeneous of degree one in the p’s 
and symmetric a and 8. We require that this definition be invariant not only 
under extended point transformations but under the full group of homogeneous 
contact transformations. Hence from equations (2.1) we must have 


ii = (% +0. oF) 4 de < 
(4.7) 7 


i; + Pj! —= ne ( 28 + Tyga oer) + da 7, 
Ops ODa 





and ¢ 
(4.9) 
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From these equations we have in consequence of (1.8) 


= ax . ap; Op;\ | _ 
ya (G+ SE) +04] 8 Ae ? Ts) ~ (Bar Tm 5p) | =o 


In order that under an extended point transformation \, be transformed into 
j;, we must have (changing indices) 





- Ox" = @2% 
Ni = el — fo— 
(68) (3 iii =) 
and 
— {Oz Oz Opi OD; 
rT; a a Fe Fir Ve . 
(49) i( 2 + Pap oe) = age + Tae aps 


The functions T'.g(x, p) are the components of an entity we call a contact 
frame and (4.9) is its law of transformation. We must show that the compo- 
nents of the contact frame are uniquely determined under any contact trans- 
formation and that the symmetry property is preserved. By means of (1.8), 
(4.9) may be written as 








ox — Or OPs | w. OPp 
‘ a T;; —— = . Ty; 
va ad (z TM =) age ’ OD; 
for which and (4.9) we get 
Ox = @27% ox* oz’ ; 
411 — Dyn — Sa = §°. 
sae (3 iis” a) (2 ilies =) 
Therefore 
ox = 
0 
ax +* “ 








and equations (4.9) determine T;; aon From (4. ~ and the inverse of 


(4.9) we have 
0x 
Py — to (% +7 ry) = 0 


consequently T';; = Ty. (See note at end of paper). 
Because of (4.7) and (4.8) we have 


ane (2% + Tm 28) +, 22 
(4.12) . 
i Ox = -O=* 
| @ &e i = Ny —— 
sites (3 + ) 


and we accordingly call 4 the vector components of the configuration A4. We 
may note that any set of functions \4 obeying (4.11) determine a unique con- 
figuration by means of (4.6) and that for two contravariant configurations 


(4.13) AgM* — A%Mg = Aah® — Aa 


ia scalar. 


eo 


— teeters 





ee a ee 


= 
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Similarly the vector components of a covariant configuration A, are defined 
by 


(4.14) Mo = AZ a = Aa + Tag AP 


with the law of transformation 


ime (2% + rep) — Ox 








Ox* Ops ” ODa 
(4.15) . , 
= re aia Y Phd 
i = Aa : ry; — }. 
naib ( ii. =) 
5. The operators X.f and P*f. We define two operators by the equations 
af af of 
= — af =» Pe SS a 
(5.1) Xaf(z, Pp) axe +  ¥ B aps f Da 
and similarly 
eae ae | on. oF 
i = —. T;; ——. Pf = ——e 
In this notation the fundamental relations (4.9) and (4.10) are 
(5.3) Ty; X.t/ = Xapi, Tas Xit® = Xipa 
and from (4.11) 
(5.4) Xd -Xa8= 88, Xaki- Xt = 45. 


In consequence of (1.8) we have 
a Pez + Pizx=0, Pez X,# = Pex Xz 
Pep; Xt — Pati Xap; = 62, — P*; Xap; = P%p; Xabi 


which are generalizations of (1.11). 
The fundamental equations for contravariant and covariant vectors under a 
homogeneous contact transformation, namely (4.7) and (4.9), take the form 


(5.6) N = v2 X ak! -b Ne Pez, i; = Xx ° A*. 
For the commutators of the operators we have 


(X., Xa) f = (X.Xp sae XpXa) f _— (X20 sy at X6Vay) Pf 














(5.7) 
= Tr, ,ap Pf 
(58) (Ps, X,)f = (P*Xs— XpP)f= —%, (Ps, Pe) f = 0. 
From (5.7) we have 
ars or. ors ODay 
5.9 lr, = — - —2 « —— — 
a ree ax? sii Ops ae Ops 
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we 0 
any | 
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from which it follows that 


(5.10) Ty ,0s + Ty, 6a = 0, Ve.oy + VU s.4e + Ty = 0. 
From (5.3) and (1.8) we obtain 

(5.11) Xf = Xaz- Xf 

and 

(5.12) Pif = X,%. Prf — Pati. Xaf. 


From these relations we have 
(513) PaXf = Xaf - PXjax + Xjxx (Xst! PPraf — Pet XpXaf) 
(5.14) X,;Pif = Xjx° . X3 (X.# P*f — PrzX,f). 
Substituting in 
(Ps, Rf = FH Pej, 
we obtain an equation of the form AX.f + BP*f = 0. Since this must hold for 


any f, and since X.f and P*f are independent as follows from (5.1), we must 
have A = B = 0; this gives the two equations 





(5:15) PiX gx + Xx? . X,Pazi + an Pez = 0 
(5.16) X,Xaki = a4 X,z - i Xat! Xpt* + Ts. ay PrZ'. 
Y 1 


From (5.11) we have X‘X,f = X;Xia--X.f + X ja*Xj28 — X,X.f. Sub- 
stituting in (X;, Xf = Ty, «P*f and making use of (5.7), (5.16) and the inverse 
of the latter, we get 


(Ty x X,z' — Ty, a8 Xja7X,.2°) Prf = 0. 
Since this holds for any f, we have in consequence of (5.4) 
(6.17) Tix = Ta, py Xia - Xjx° . X,a7 


and consequently I'.,37 are the components of a covariant tensor in V, skew- 
symmetric in the indices B and y. 
For a configuration of the second order of components A**, A%s, Ag*, Aag We 
have analogously to (4.6) 
(5 18) axe — Ax, a%s = a%s —_ T3yA°7, ag* = Ag* — T'3,A% 
Aas = Aas = Tey A‘; = Ts, A,’ + Poy Tg A” 


Where a**, a%s, ag, dag are components of tensors in V, which under a contact 
\tansformation are related as follows: 


~J 
or 
> 

Tt 





pea 


= 














756 L. P. EISENHART AND M. S. KNEBELMAN 


aii = (a%® X,%/ + a%g P8Z/) X,F' + (aa® X pk + dag P8Z%) Paz 
a‘; = (a%, X ok! + Gag P;') X28 
(5.19) | . es 
a; = (ag% Xak' “fe Age P«Z') X;x8 
aij = Gag X;x . X,2°. 


We call a4 the tensor components of the configuration A4*. In particular, if 
we take 


(5.20) a% = — ds* = 63 = Aap = PaDs 
we find that the corresponding a’s are of similar forms and 
(5.21) ai = a%*® Xz X pi. 

For a mixed configuration A4z the vector components are 


a8 = A%B, = a%y = A%e+TpyA%, = g* = Ag* — Tp, A, 


(5.22) 
Aap = Aag + TgyAa™ sai Tay As — Toy T'gs Av 


with the component transformation 

aii = (a X,Hi — a% PPR)X aE + (da? Xp Fi — dag P? Zi) Po z! 
_— ai; = (a% Xad + nlieienag 

G;' = (ag*Xai' — Aga P* X')X; 28 

dij = Mag Xi 2% X; 2°. 
Similarly for a covariant configuration A4s we have the tensor components 


ae = Ae, — a%s = A%s+T5yA%, = ag* = Ag® + Igy A™ 


(5.24) 
eg = Aas + | on As + Ty Aa? + Tey T'g3 A®. 
6. Covariant differentiation. If \4 are the vector components of a con- 
travariant configuration A4, the quantities X.4, P*\4 do not have tensor 
character under contact transformations. In order to obtain invariant con- 


figurations from a given one by differentiation, we define covariant differentia- 
A 


tion in terms of the components Isc of a linear connection assigned to Vn. If 
we denote the operators X_ and P* symbolically by a and ~, we define 
covariant differentiation of \4 by 


_ an 


A 
(6.1) M2 = age + d° les 





wl 


F 
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which is the symbolic expression for the four definitions 
a a 
A%3 = Xgr* + ATL + AWl%, 


Aw = PBA* + ALP +4 18, 
Aa? = PPr, + ATL,F + hy 178 , 


Nara = Xpra + AML gs + Ayls. 


We seek the conditions connecting the l’s and the l’s in a new system 2, p 
so that A**®, A%;g, Aa’®, Aa:g are four tensors determining a mixed configuration 
of the second order. In accordance with (5.23), we must have 


Ny = Kpat-X;2? rap. 

In accordance with (5.6) and (5.11) we have 
(6.3) Xpra = XpXak-di + Xa dX H X;H; 
and since (6.3) must hold for any \4, we obtain 

XgXati = li,XqHX gs" — 4g X_F + ly PF 
(6.4) eee oe 

Lic = Igy Xix* Xj; 2°Xi2x7. 
For the third of (6.2) we must have 
Ait = d'8 X;a* X pF — rarg Xi xt PP Zi. 

In accordance with (5.12) we have 
(6.5) Pi; = Pi X;27-d, + Xix2(XpH PPA. — PPR Xpra) 
and we obtain the two sets of equations 
PiX;27 = Xiae (be Xp@ — 5 PF ®) _ Bi Xe ay li Pa 


(6.6) —— 
L# = Igy X;a* Xj; 2° X, _ Igy Xi a* Xj 2° Pr Z*. 


Proceeding in like manner with the quantities \.3 and \“‘4 in equations of the 


form (5.23), we get the two sets of equations 


¥ t 
Xp Xa Ht —_ lap X y Z! = Lie Xa ti Xg t* + lag PY &' 
(6.7) Y 


Y ce 5 ? 
Xp P zi = le, X., x + l=, P x — (i, Xp + I; P*z*) Xi 
Y 


aes ~“s.2 te or Mo 
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and 
Ps Paz = [99 X, zt + [8 Pr zg — (ie, Xp" + ly P= zt) P8 j 
Y 
— (is X20 + ly’ P= 3) X;x8 


(6.8) - ‘ 
X,z'-P8X,z — Pes -X,X.H = (ex, - Lap P*2i) Xz 


4 (hf Xox _ lag P* 2) Prgi — 1) X, 2. 
Y Y 
The eight sets of equations (6.4), (6.6), (6.7) and (6,8) are just sufficient to 
I A 


determine uniquely the components lx in terms of Isc. However, we are not 
A 


free to choose the lsc arbitrarily because of (5.15), (5.16) and the commutator 
conditions (5.7) and the second of (5.8). From the first of (6.4) and (5.7) we 


get 
(6.9) (Ui a Ui) X qt Xp x wet (Ms i a) X, Zi i (be om bre + T'y.08) Pri, 
i a a 


From the first of (6.6), the second of (6.7) and (5.15) we get 
a 7 sz, e 
(i, + Hs) X,.2% = (i + ) X;2° Xz + (i — ,) X; x8 Pz 
j B Y 8 


i 
aa (Se — lj — h') Pez. 
Op; i 


(6.10) 





Since the first of (6.8) must be symmetric in a and 6 we have 


Y Y i i id 
(i — ve) Xt + (a ~ pe) Pre = (i — bi) X20 Ra? 
Y 7 


(6.11) , , 
+ (is - ls) Pa gi PBZ + (i _ is) (X, 2% P' gi — X,x° Pz’). 


From the first of (6.4) and (6.7) we obtain by subtraction 


Y mi i ' 
(6.12) (i + Ps) Xy Zz — (is oe i) Xok Xgx* + (ts atte bn) P7z = 0. 
a 7 7 a 
If the first of (6.7) and the inverse of the first of (6.6) are substituted in the 
second of (6.8) we obtain 


(ie 4 w) X, a+ (1 4 ls) Xp! P13 Pog — (is 4 r) Xd X5@ 
k k Y 


(6.13) 
+ (ws _ uf) Xp a Pez = 0. 
Y a 
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From (6.6), (6.7) and (6.8) we obtain 


] s— Te) x zi X p= (i l ~ t3) zi 
(i +I; Op: ak’ Age ap + nl OP, pe 





(6.14) 
+ (ts aoa ley + rar) Pym. 


. 

7. Symmetry conditions on the linear connection. From (5.18), (5.22) and 
(5.24) it follows that the tensor components of the three fundamental numerical 
configurations e4”, 4s and 6% have the same respective forms as the components 
of the configuration. From §5 it follows that if A, = e42A?, the tensor com- 
ponents of these associate covariant and contravariant configurations are in the 
relation 


(7.1) Na = End? 


We define the covariant derivatives of \4 by equations analogous to (6.1), 
namely 


. Or 4 
(7.2) Nays = 3 — Aclas. 


Extensions of the definitions (6.1) and (7.2) to tensor components of configura- 
tions of higher order are made as in ordinary tensor analysis. We require that 
the covariant derivative of an associate configuration be the associate of the 


covariant derivative of the original configuration. In particular, for (7.1) we 
must have 


Nass = €ac\°:B 
but from (7.2) we have 


Na;B _ eacd°;z + VA exc;2 . ' 
Consequently the above requirement is equivalent to 
(7.3) 


éaz;c = O 

Since 63.¢ = 0, it follows from (3.6) that 

(7.4) e4B.c = 0. 

lhorder that (7.3) be satisfied, it is necessary and sufficient that the components 


of the connection satisfy the six sets of relations 


a B a 


(7.5) Bo = Iec, lac => —%_ ’ lsc = lac . 
B a 8 


If and x4 are the vector components of two contravariant configurations of 
de tet coder, We Taies hatch te (4.13) that ¢ = Aau* — A%He is a scalar. 
The requirement that 

; Ga = Xa, = Py 
Sequivalent to (7.5). 
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In view of (7.5) the conditions (6.12) and (6.13) are satisfied identically, 
We impose further symmetry conditions on the linear connection to satisfy 
the other conditions in §6. Since ls, and T.,s, each obey the same linear 


homogeneous law of transformation, we take 





(7.6) Le — re = Typ. 

From (7.6) and (6.14) it follows that we may take 
7 O1'sy 

yf lsy + ly* = 

( ) By Dy a Da 


as an invariant condition. From (7.7) and (7.5) we have 


7.8 ms ms 
vane lay = lye y 


in consequence of which and (7.6) the conditions (6.9) are identically satisfied. 
In consequence of (7.7), (7.8) and (7.5) we may take 


ina B, = Lf, 
from which it follows by (7.5) that 
(7.10) [Py = [8, 


Making use of the above relations and (6.11) we may take 


a jor = I 


as a result of which and (7.5) 187 is symmetric in all indices. 


All these conditions may be combined into the following definition: 
A 


The components of linear connection Igc are chosen subject to the invariant 
conditions 


(7.12) : . 


AK leo = €FK Lec 


(7.13) 4 


lac _ les 


with the addition of (7.6), (7.7) and (7.10). 


8. Curvature; the Ricci identity. In ordinary tensor analysis covariant 
differentiation is not commutative except for scalar functions. If t’"’ are the 
components of a tensor, t'"*;,; — t!11,;,, is expressible linearly in terms of the 
components of the tensor, the coefficients being the components of the curvature 
tensor of the connection employed in the covariant differentiation. We shall 


show that this is the case with our linear connection provided that the symmetry 
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conditions of §7 are satisfied. In fact the above requirement is equivalent 
to the full set of symmetry relations. 
Let 44 be the vector components of a configuration of the first order; then 





ee 
AtB;c = + \? 5 rm saad \* D ie 
a*n4 » alos Suk 
~ aa age + ™” Gyo — Moe Inc 


A A D 
+ APc los + APB loc — Md lac. 





Therefore 
074 074 4 ° 
AtB;c — Ato; = ds as® ~ DeP Ost A*D (inc _ len) 
(8.1) A A 
al, al - 
+ 2 (sae dt « ae i a — le ns) 


We have to consider three possible cases of which we shall develop only one viz. 
B=8,C = y. Here 


D D A 
(X,Xp oz: XX) aid At D (i, ai la) = Toys rt + Dspy (py. +»? io) 


A E E 
= 0? loz (i,, — he 


because of (7.6) and (7.13). Similar results are obtainable for the cases B = 8, 
C=n+yandB=n+8,C = n+ 7; we therefore have 


A 
(8.2) Ai B;¢ —Atc;8 = ” Kose 


where 


A 
A A E A 
(8.3) Kose = cor ie + ie lec — loc len — tis (inc ~~ ice). 


A 
Equations (8.2) are the analogues of the identity of Ricci and Kosc are the 
tensor components of a configuration of the fourth order that we designate as 
A 


the curvature of our linear connection. It is easily verifiable that Kosc have 
A 


the same symmetry properties in the indices A and D that lpg have. 

Since Tag is homogeneous of degree 1 in the p’s, the contact frame is com- 
pletely determined by the linear connection by means of (7.7). It is then not 
difficult to verify that the vanishing of the curvature tensor is a necessary and 


sufficient condition that the linear connection be flat, that is, that there exist a 
A 


homogeneous contact transformation which transforms the given connection lsc 
A 


into lee = 0. 
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9. Parallelism. Paths. If by a curve in V, we understand a set of points 
given by z* = x°(t), we are not in a position to apply a contact transformation 
to the curve unless we have a way of attaching at each point an elemental 
hypersurface p.(#). In order that this be done in an invariant manner we make 
use of our contact frame and require that the vector components of the con- 
figuration = =p be = 0.5 We therefore assume that x‘ = z(t), p; = p;(t) 


define a curve, if and only if 


dp; _ 
> ie 
That this condition is invariant under contact transformations follows from the 
law of transformation of the contact frame. 

We now say that a configuration whose vector components are \4, is parallel 
with respect to a curve if 


dx! 
(9.1) Pi(2, p) —. 


(9.2) 14. = =O 


(° 
and a configuration field is parallel if 
(9.3) ra = 0. 


Equations (9.3) are also invariant under contact transformations since the 
transformation law of the components u4,_ does not involve u4%. In particular 
we say a curve is a path if the configuration od , oe is parallel with respect to 
itself. This leads to the equations 


dye * dx® dav 


ae + ex, p) = a = 0 


(9.4) ‘ a 
Ee ws ~~, 
and 
dx® dav 
9.5 po 
( ) a dt dt 0. 


If we require that there be a path through each element 2, Mo in any direction 


d. 
(2). ls, + lyg = 0. Hence we assume 


(9.6) ls = $ Ta, p7 


a 


so that (9.5) are satisfied identically. 





dz* 
* There would be no gain in generality if we required the vector components to be Tr 


¢(t)p; as by a change of parameter this is reducible to the assumption we are making. 





whe 


Carr 
com 








T 


mn 





HOMOGENEOUS CONTACT TRANSFORMATIONS 763 
10. The metric. Extremals. If \4 are the vector components of a con- 
figuration, its length is given by 
(10.1) 2 = gan(z, p) \4v2 


where gas = gza are the tensor components of a configuration of the second order 
and are assumed to be homogeneous of degree zero in the p’s. Just as in the 
case of paths we did not need the full linear connection so in the case of curve 
length we do not need the full metric tensor. According to (10.1) the length 
of acurve is given by 


(10.2) ds? = gag(x, p) dx dx°. 
Let (2%, po) and (21, p1) be joined by a curve 
(10.3) 7 = x(t), es p(t), dp; _ Vy;dz’, 
and consider the field of curves given by 
Zi = Z(t) + w(x, p)io 
(10.4) 
Di = pilt) + w;(z, p)éo 


where w‘ and w; are so chosen that the points corresponding to the same value of 


tshould constitute a curve; that is, since a = w' and “Bs = w; we must have 
Co o 


(10.5) — 
Since for a given 60, and j must define a curve, we get from (10.4) and (10.5) 
(10.6) dw; = Tyzdwi + w*X,T;;-dz’. 
From (10.5) we have 
dw; = Tyjdwi + w'X Ty dz’. 
Comparing these two results we find 
(10.7) T;,2d2/ia* = 0.8 


When we minimize the arc length from Po to P; we find, as in Riemannian 
geometry, that the Euler equations become 


0 0 da 
10.8 = aE ot ao ae 
ons oat + Pe ap; dt di 
ds ; é 7) a 2 
where g = — hao —, Si — + T;— = X, if we let the parameter 
(~. and z i Since = + Ts; Op; , p 





: The geometrical significance of this set of equations is that if the configuration (p;, 0) is 
carried by parallel displacement around the contour z, z + 62, x + 6x + dz, x + dz, the 
components are unchanged. 
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be the arc length, the equations of the extremals are found to be 


2 i : i dork 
oF + {5b ee _ 


ds? ' \ jk) ds ds 
(10.9) ? 
dpi _ rr. @@ 
ds ” ds 
where the Christoffel symbols are formed out of g.g using the operator X, for t 
0 
ax . 


We thus have, as in §9, a unique extremal determined by the initial conditions 
20, Po, %. A particular set of curves are given by the differential equations 


dx‘ - dp; . 
(10.10) se = 92, Pp; = Taghpy. 


The requirement that these curves be extremals gives 
9” Ps Ps (x, + PPP yg + 3} i) ©. 


Hence as in the case of a linear connection (Cf. (7.7)), if we let 


P*T ey — ‘eh To, 


we shall have 


(10.11) . L* pag” ps = 0.7 
B 


More generally if C(x, p) is a scalar function homogeneous of degree one in 


a =. is a covariant configuration of the first order whose contra- 


the p’s, — 
P*s oza ODa 


variant vector components are ~ and XC; these components will determine 


a 


a curve 


dx* aC dp ac 
10.12 . saget OR aeons el 
( ) ds Da’ ds ax 
if and only if 

(10.13) XC = 0. 


Equations (10.13) are a system of partial differential equations whose inte- 
grability conditions are T'a,,P*C = 0, (Xs Ta.8y — Veal egy) P*C = 0 ete. 
Assuming that these conditions can be satisfied and that C is so chosen as to 
satisfy (10.13), we consider the question whether these curves can be extremals. 





’ The equations (10, 11) mean that the vector (pa, 0) is parallel with respect to the 
extremals (10.10). 
* Cf. C. G. §67 where C is taken to be /(g‘ip; pj). 


a 
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If this is to be the case we must have 


X,P*C-P#C = -{ 5) pro.prc 
By 
and since 
XpP*C = P*XgC — P*Tgy-P1C = —P*Ts,-P7C 


the required condition reduces to 


(Pry - 1s) P®C.P1C = 0, 


Nore (added in proofs, October 15, 1936): In §4 it is shown that if equations (4.9) are 
consistent they admit a unique solution, It is therefore understood that a V, with a 
frame I'gg being given, it is subjected to transformations for which (4.9) are consistent. 


PRINCETON UNIVERSITY. 
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THE SUBGROUP OF THE ELEMENTS OF FINITE ORDER OF AN 
ABELIAN GROUP { 


By REINHOLD BAER 


(Received March 5, 1936) 


Every (additively written) Abelian group A determines two characteristic 
“unmixed” groups: the subgroup F(A) of all the elements of finite order in A 
and the classgroup J(A) = A/F(A). All the elements of F(A) have finite order 
and 0 is the only element of finite order in J(A). If conversely all the elements 
of the Abelian group F have finite order and 0 is the only element of finite order 
of the Abelian group J, then the direct sum A = F + J satisfies F(A) = F and 
I(A) = J.1. The structure of A is therefore completely determined by the 
structure of F(A) and of (A) if, and only : F(A) is a direct summand of every 
group A satisfying F(A) = F and I(A) = 

In the following, conditions will be given cee the group F(A) to be a direct 
summand of the group A. In accordance with the above considerations we are 
not interested in conditions which are dependent on the structure of the whole 
group A, but only in such conditions which depend on the structure of F(A) and 
(or) I(A) alone. 

The principal results can be found in section 8. and the concepts used for 
enunciating them in the sections 2 and 3. 


1. Subgroups satisfying nS = S for every integer n. 

(1;1) If the subgroup S of the Abelian group A satisfies nS = S for every integer 
n # 0, then S is a direct summand of A. 

Proor:? Let 7 be a greatest subgroup of A such that the intersection of S 
and T contains only the element 0. Then the subgroup of A which is generated 
by the elements of S and T is the direct sum S + T of S and T. If z is any 
element of A, then the subgroup 7'(x) of A, generated by zx and the elements in T, 
contains only elements of the form nx + t where n is an integer and ¢ an element 
of T. By the choice of T either x is contained in T or there exists an integer m 
and an element ¢’ in T such that mz — t’ is an element ~ 0 in S. 

In particular to every element z in A there exists an integer n ~ 0 such that 
nz = s + tis anelement in S + T. Since S = nS, there exists an element s’ 





t Presented to the Am. Math. Soc., February 29, 1936. 

1 This is not quite correct, since the groups I (A) and J are only isomorphic, but not 
identical. But here and in the following we identify isomorphic groups wherever that is 
possible without confusion, in particular always when there exists a ‘‘natural’’ isomorphism 
between a given group and a class group. 

2 The statement (1;1) is well known. But we prove it here, since, to the authors knowl- 
edge, it has not been published before. 
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in S such that ns’ = s and therefore the element 2’ = x —.'s’ will satisfy: 
1 =xmodS and nz’ =¢t. If X is the set of all the elements y such that 
y =zmod S and ny = 0 mod Tf for a certain positive integer n, then X is not 
yacuous and contains elements w satisfying vw = 0 mod T with a smallest 
positive v. But then there exists, as noted before, an integer u and elements a # 0 
in S,bin T such that ww = a + 6 provided w is not contained in T. If (u, v) 
is the greatest common divisor of u and v, then there exist integers h and k such 
that (u, v) = hu + ko and therefore (u, v)w = ha + (kvow + hb) = hamod T. 
It follows as before that X contains an element w’ satisfying (u, v) w’ = 0 mod T 
and therefore that v = (u, v) ie. wis a multiple of v. But that leads to a 
contradiction with a ¥ 0 and therefore w is contained in T’, i.e. x is congruent 
mod S to an element of T, i.e. A = S+ T. 


2. Groups without elements of infinite order.’ If F is an Abelian group 
without elements of infinite order, then to every prime number p there belongs 
the primary-component C(p, F) of F which consists of all those elements of F 
whose order is a power of p. F is the direct sum of its primary-components 
((p, F) and therefore every element x of F is the sum of its uniquely determined 
primary-components c(p, x) contained in C(p, F) respectively. 

Let now B be a primary Abelian group, belonging to the prime number p. 
Then p*B is the intersection of all the groups p‘B for positive integers 7. 

(2;1) If the orders of the elements of B/p*B are bounded, then p*B = p(p*B). 
p°B is therefore (by (1;1)) a direct summand of B and B a direct sum of p*B and of 
cyclic groups (of bounded orders). 

(2;2) If the orders of the elements of B/p*B are not bounded, then there exist 
elements a(i) in B such that the orders of the elements b(1) = p Dies pia(j) tend to 
infinity with i and such that b(t) is an element of lowest order in its class mod p‘B. 

Since the primary group B belonging to the prime number p satisfies gB = B 
for every prime number gq ~ p, the statement (1;1) implies: 

(2;3) Let Q(A) be the direct sum of the groups C(p, F(A)) which satisfy 
pC(p, F(A)) = C(p, F(A)). Then Q(A) is a direct summand of A. 

It is now easy to give an example of a group A such that F(A) is not a direct 
summand of A. For let B be a primary group such that the orders of its 
elements are not bounded and such that p*B = 0. The sets K(z) for7 = 1, 
2,--- form an L-series of B, if K(i) is a class of B mod p‘B and K(t) < K(z — 1) 
forevery 7. The intersection of all the classes of an L-series is either vacuous 
or contains exactly one element, since p*B = 0. Therefore it is possible to 
identify the L-series p'B + x with the element xz of B. If K(z) and H(z) are two 
L-series, then K(i) — H(i) is also an L-series. The set B of all the L-series in B 
is therefore an Abelian group, containing B, the p-adic closure of B. 





*This section is a compilation of mostly well-known facts. 
‘A first example of such a group has been given by F. Levi: Abel-sche Gruppen mit 
abzdhlbaren Elementen, Hab.-Schrift, Leipzig, 1919. 
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Since the orders of the elements of B are not bounded, it follows from (2:2) 
that there exist elements of infinite order in B. Furthermore the only direct 
summand of B which contains B is B. Since F(B) contains B, it follows there- 
fore that F(B) is not a direct summand of B. 


3. Groups without elements of finite order. Suppose that 0 is the only 
element of finite order in the (Abelian) group’ J. If p is a prime number, then J 
is said'to be p-complete, if J = pJ # 0. If J is p-complete for every prime 
number p, then J is complete. 

By (1;1) every complete subgroup of J is a direct summand of J. Since the 
joingroup of two complete subgroups is also a complete subgroup it follows that 
there exists a uniquely determined greatest complete subgroup, provided there 
exist complete subgroups at all. 

(3;1) Every group J without elements of finite order is contained in one and 
essentially only one smallest complete group. 

Proor: Let P be the set of all the pairs (z, n) for z in J and n a positive 
integer. Define equality in P by x = (2, 1), (x, n) = (y, m) if, and only if, 
mx = ny, and addition by (x, n) + (y, m) = (mx + ny, mn). 

This group P is a smallest complete group, containing J, and if P’ is another 
smallest complete group, containing J, then there exists an isomorphism between 
P and P’ which leaves all the elements in J invariant. 

If G is a greatest linearly independent subset of the complete group J, (x) the 
subgroup of all the elements of J which are dependent on z, then J is the direct 
sum of all the groups (g) for g in G and every group (g) is isomorphic with the 
additive group of all the rational numbers. As a consequence of these facts and 
of (3;1) it follows: 

(3;2) If J ts a group without elements of finite order, then any two greatest linearly 
independent subsets of J contain the same number of elements. This number is 
the rank of J. 

The subgroup S of the group J is closed if J/S does not contain elements of 
finite order, i.e. if all the elements of J which are dependent on S are contained 
in S. 

DEFINITION 3;2: Let J be an Abelian group without elements of finite order. 
If J is countable, then D(J) = 1. If v is any (finite or infinite) positive ordinal, 
then D(J) = v, if DJ) < v and if there exists a closed subgroup S of finite rank 
such that J/S is a direct sum of groups J’ with D(J') < v. 

Note that there exist groups J such that D(J/) is not defined e.g. the group of 
all the sequences of integers. 

(3;3) Suppose that the group J without elements of finite order contains a closed 
subgroup S of finite rank such that D(J/S) exists. Then D(J) exists and satisfies 
D(iJ) = D(J/S). 

Proor: If J* = J/S, D(J*) = v, then there exists a closed subgroup T* of J ° 





5 Then we say that J is a group without elements of finite order. 
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of finite rank such that J** = J*/T* is the direct sum of groups J**(u) satisfy- 
ing D(J**(u)) < v. Let J*(u) be the subgroup of J* which corresponds to 
J**(u), T and J(u) the subgroups of J corresponding to J*(u) and T* respec- 
tively. Then T is a closed subgroup of finite rank of J and, since J(u)/T and 
J**(u) are isomorphic, it follows that D(J(u)/T) < v. Hence J/T is the direct 
sum of the groups J(u)/T of smaller D and therefore D(J) S v. 

The following concepts will be needed later on: 

Let P be an (infinite) set of prime numbers. Then the element x is P-infinite 
(in J), if z ¥ 0 and if for an infinity of prime numbers pin P there exist solutions 
of the equation py = zin J. If the group J contains a P-infinite element, then 
J itself is called P-infinite and J is called almost P-infinite if there exists a closed 
subgroup S of finite rank such that J/S is P-infinite. 

Similarly: if p is a prime number, then J is called almost p-infinite if there 
exists a closed subgroup S of finite rank such that J/S contains a p-complete 
subgroup. 

If e.g. J is the additive group of all the integer p-adic numbers, then D(J) = 2 
and J is almost p-infinite but does not contain any p-complete subgroup. 


4. Extension of groups. 
(4;1) If A ts any Abelian group, x* ¥ 0 an element of A/F(A) and p a prime 
number, then there exists a group A’ such that 


(a) ASA’, 

(b) F(A) = P(A’), 

(c) the equation py* = x* has a solution y* in A'/F(A), 
(d) ether A = A’ or A’'/A is a group of order p. 


Proor: If the equation py* = x* has a solution y* in A/F(A), then A’ = A 
satisfies the conditions (a) — (d). If on the other hand the equation py* = 2* 
has no solution y* in A/F(A), let x be any element in A such that z* = F(A) + x 
and let A’ denote the group which contains A and an element y such that py = z. 
Then every element of A’ Ras the form a + cyfor ain A and 0 Sc < p where 
a, ¢ are uniquely determined by the element a + cy. Suppose now that n 
is a positive integer such that n(a + cy) = 0. If c = 0, then ais an element 
of F(A). If c ¥ 0, then c and p: are relatively prime and, since —na = ney, it 
follows that ne and therefore n is a multiple of p, ie. n = pm and hence 0 = 
m(pa + cx). Therefore pa + cz is (as an element of A) contained in F(A) or 
p(—a) = cx mod F(A). But since p is prime to c, this implies that py* = 2* 
has a solution y* in A /F(A) and this is impossible. Therefore F(A) = F(A’), 
ie. A’ satisfies the conditions (a) — (d). 

4;2) ny A is any Abelian group, then there exists an Abelian group A’ such that 


A'/F(A') is complete, [or 0], 


( 
(a) A 
: nas = ‘sey, 
(d) all the elements of A’/A are of finite order. 
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Proor: By well-ordering the elements of A/F(A) and by applying (4:1) 
successively on all the elements of A/F(A) it follows that there exists a group 
A* such that 


(a) A S A’*, 

(b) F(A) = F(A*), 

(c) the equation py* = 2x* has a solution y* in A*/F(A) for every element 
xz* ~ 0 in A/F(A) and every prime number p, 

(d) A*/A contains only elements of finite order. 


To A* there exists an analogous group (A*)* = A** and so on and the join A’ 
of all these groups satisfies the conditions of (4;2). 

(4;3) Suppose that A is an Abelian group, J a group without elements of finite 
order and A/F(A) a subgroup of J. Then there exists a group A' such that 


(a) ASA’, 

(b) F(A) = F(A’), 

(c) A’/F(A) = J (ie. there exists an isomorphism between A’/F(A) and J 
which leaves invariant the elements of A/F(A)). 


Proor: By (4;2) there exists a group A” such that A < A”, F(A) = F(A”), 
A’’/F(A) is complete and is the (by (3;1) essentially uniquely determined) 
smallest complete group which contains A/F(A). Furthermore there exists by 
(3;1) an essentially uniquely determined smallest complete group C which con- 
tains J. Since A/F(A) is a subgroup of J, there exists exactly one smallest 
complete subgroup S of C which contains A/F(A) which is therefore essentially 
identical with A’’/F(A). By (1;1) Sis a direct summand of C,i.e.C = 8S + T. 
Now it is easy to see that there exists a group A’ between A” and A” + T 
which satisfies the conditions (a) — (c). 

(4;4) If F isa group without elements of infinite order, J a group without elements 
of finite order and F(A) is a direct summand of every group A such that F(A) = F 
and A/F(A) = J, then F(B) is a direct summand of every group B such that 
F(B) = F and B/F(B) s J. 

Proor: By (4;3) B is contained in a group A such that F(A) = F(B) = F 
and A/F(A) = J. F(A) is therefore from the assumption a direct summand of 
A, ie. A = H + F(A). If K denotes the intersection of H and B, then 
B =K + F(B), since F(A) = F(B). 

(4;5) If F is a group without elements of infinite order, J a group without ele- 
ments of finite order, and if F(A) is a direct summand of every group A such that 
F(A) = C(p, F) and A/F(A) = J, then C(p, F(B)) is a direct summand of every 
group B such that F(B) = F and B/F(B) = J. 

Proor: If Q is the direct sum of all the C(q, F) with gq ¥ p, A = B/Q, then 
F(A) is essentially = C(p, F), since every class of F(B)/Q contains exactly one 
element of C(p, F). Furthermore A/F(A) = J. F(A) is therefore by the 
assumption a direct summand of A, ie. A = H + F(A). Let K be the sub- 
group of 8 corresponding to H (such that Q < K and K/Q = H). The inter- 
section of K and C(p, F) equals the intersection of Q and C(p, F) and consists 
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therefore of the 0-element alone. B is generated by the elements in C(p, F) 
and in K, since they represent all the classes of B/Q. Hence B = C(p, F) + K 
= C(p, F(B)) + K. 


5. The first necessary condition. 

(5;1) Suppose that the group J without elements of finite order contains an almost 
p-complete subgroup. Then J contains two subgroups S, T and elements e(i) such 
that 


(a) Sis a closed subgroup of finite rank of T, 

(b) S does not contain any almost p-infinite subgroup, 

(c) T is generated by the elements e(1) and the elements in S, 
(d) e(0) # O mod S, pe(t) = e(2 — 1) mod S. 


Proor: If V is an almost p-complete subgroup of J, then there exists a closed 
subgroup W of V which is of finite rank such that V/W is p-complete. Therefore 
V contains an element x which is not contained in W and the smallest closed 
subgroup of V which contains x and W is of finite rank and almost p-infinite 
The existence of an almost p-infinite subgroup of J implies therefore the 
existence of an almost p-infinite subgroup of finite rank. Therefore there 
exists an almost p-infinite subgroup U of smallest (finite) rank. Since U is 
almost p-infinite, it contains a subgroup S (closed in U) such that U/S is 
p-eomplete. The rank of S is smaller (and therefore finite) than the rank of U, 
since U/S + 0, and therefore S cannot contain any almost p-infinite sub- 
group. If e(0) is an element of U which is not contained in S (and such an 
element exists) then there exist elements e(z) such that pe(7) = e(¢ — 1) mod S 
for? = 1, 2,---, since U/S is p-complete. The subgroup T of J which is 
generated by S and the elements e(7) clearly satisfies (together with S and the 
elements e(7)) the conditions (a) — (d). 

(5;2) Suppose that the group J is of finite rank and not almost p-infinite. Then 
to every greatest linearly independent subset G of J there exists an integer w such 
that an equation pe+k¥a = > > - c(g)g with integer coefficients c(g), 0 < k, (xin J) 

in G 
implies that c(g) = 0 mod p for every g inG. 

Proor: G is finite, since J is of finite rank. Furthermore it is sufficient to 
prove the statement for k = 1. If this special case of (5;2) would not be true, 
then there would exist to every positive integer 7 an element 2x(z) in J and 
integers w(g, 7) not all of them = 0 mod p such that 

pix(i) = 2) wg, ig. 
gin@G 
Since G is finite, there exists an element h in G such that w(h, 7) # 0 mod p for 
an infinity of numbers 7. ‘Since for these values of 7 the prime number p and 
w(h, 7) are relatively prime, it follows easily that there exist elements y(7) in J 
and integers v(g, 7) such that 
py) =h+ Do v(, dg. 
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If H is the smallest closed subgroup of J which contains the elements g ¥ h in 
G, then H < J and p‘y(i) = hmod H, ie. J is almost p-infinite and this is 
impossible. 

THEOREM 5;3: Suppose that g is a prime number, that F is a group without 
elements of infinite order, that J* is a group without elements of finite order and that 
(1) the orders of the elements of C(g, F)/g*C(g, F) are not bounded, 

(2) J* ts almost g-infinite. 

Then there exists a group A such that F(A) = F, A/F(A) = J* and such that 
C(g, F(A)) ts not a direct summand of A. 

Since C(g, F(A)) is a direct summand of F(A), this implies that F(A) is not a 
direct summand of A. 

Proor: Because of (2) and (5;1) there exists a subgroup T* of J*, a subgroup 
S* of 7* and elements e*(z) in T* such that 


S* is closed in 7* and of finite rank, 

S* is not almost g-infinite, 

e*(0) is not contained in S*, 

s*(7) = ge*(z) — e*(¢ — 1) is contained in S* (for 0 < 2), 
S* and the elements e*(z) generate 7*. 


Let S be a group isomorphic with S* and let always s and s* correspond under 
this isomorphism. 

By (1) and (2;1) there exist in C(g, F) elements a(z) such that the elements 
b(t) = >> i=) gia(j + 1) are elements of lowest order in their class mod g‘C(g, F) 
but such that their orders tend to infinity with 7. 

Now let 7 be a group containing the direct sum F + S and elements e(7) such 
that 

ge(i) — et — 1) = s(t) + a(?) 
for every positive 7. 

This (Abelian) group T satisfies: F(T) = F and T/F(T) = T*. 

Suppose now that C(g, F(T)) = C(g, F) is a direct summand of T. Then 
there exists a subgroup H of 7 such that T = H + C(g, F). Let Q be the 
direct sum of all the primary-components C(p, F) with p # g. Then there 
exists to every element z in Q + S an element f(z) in C(g, F) and to every ele- 
ment e(z) an element f(z) in C(g, F) such that x + f(x) and e(z) + f(2) are con- 
tained in H and these elements f in C(g, F) are uniquely determined by this 
condition. Since H and Q + S are subgroups of 7, it follows therefore that 


f(x + y) = f(@) + fy) for z and yin Q + S. 


If x is an element in Q, the orders of x and of f(x) are relatively prime and the 
equality for the function f(z) implies therefore that f(x) = 0, if x is contained 
in Q. 

(*) The orders of the elements f(x) for x in Q + Sare bounded. | 

In order to prove this statement consider a greatest linearly independent sub- 
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set Gof S. There exists an integer w by (5;2) such that get*z = >>, i, 9 c(hyh 
for z in S and integers c(h),0 < k, implies that all the c(h) are divisible by g*. 
If now z is any element in Q + S, then z = y + s for y in Q and s in S and 
therefore as noted before: f(x) = f(s). If s ¥ 0, then there exists an integer v 
and integers »(h) such that vs = >>, ing v(h)h and v and the numbers v(h) have 
nocommon divisor. But this implies that v is not divisible by gt! and since G 
is finite and vf(s) = Doaine o(h)f(h), we have proved (*). 
Since furthermore 
g(e(z) + f(z)) = e(@ — 1) + s(z) + az) + gf) 


ei — 1) + fl — 1) + s(%) + f(s), 


we have: 
(**) gfe) — fi — 1) = f(s) — a). 


By (*) there exists a number v such that g*f(z) = 0 for xin Q + S. Then it 
follows from (**) that 
vti-1 


g’**flv + t) — f0) = dX g'(f(s(j + 1)) — a(j + 1) 


v—l 


=F efj+)- "LY va +0 


7=0 
since the elements s(j) are contained in Q + S. But this equation implies 
$0) + LeF=s g*f(s(j + 1)) = b@ + ¢ — 1) mod g™** CG, F) 


and this congruence contradicts the choice of the elements a(z), since its left 
side does not depend on 7. Hence C(g, F(T)) is not a direct summand of T. 

By (4;3) T is contained in a group A such that F(A) = F(T) = F and A/F(A) 
=J. Then C(g, F(A)) is not a direct summand of A, since otherwise C(g, F(A)) 
= C(g, F(T)) would be a direct summand of T and this is impossible as proved 
before. This completes the proof of our Theorem. 


6. The second necessary condition. 

(6;1) Suppose that P is an infinite set of prime numbers and that the group J 
without elements of finite order is almost P-infinite.. Then there exists an infinite 
subset P* of P, a pair of subgroups S and T of J and to every prime number p in P* 
an element e(p) such that 


(a) Sis a closed subgroup of finite rank of T, 

(b) S is not almost P-infinite, 

(c) T is generated by the elements e(p) and by the elements in S, 

(d) pe(p) = ge(q) 4 O mod S for every pair p, q of prime numbers in P*. 


Proor: Since the group J is almost P-infinite, there exists a closed subgroup 
U of finite rank of J , an infinite subset P’ of P and elements v(p) for every p in r’ 
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such that po(p) = qo(q) ¥ 0 mod V for every pair p, q of prime numbers in P’. 
The group V, generated by U and.the elements v(p), is an almost P-infinite sub- 
group of finite rank of J and therefore J contains an almost P-infinite sub- 
group W of lowest rank. W contains a closed subgroup S such that W/S is 
P-infinite (and of rank 1). Since the rank of S is lower than the rank of W, § 
is not almost P-infinite. Since W is almost P-infinite, there exists an infinite 
subset P* of P and elements e(p) for p in P* such that e(p) is contained in W 
and satifies: pe(p) = ge(q) # 0 mod S for every pair p, q of prime numbers in 
P*, §, P*, the elements e(p) and the subgroup 7, generated by S and the 
elements e(p), satisfy (a) — (d). 

(6;2) Suppose that the group J without elements of finite order is not almost 
P-infinite, that J is of finite rank and that G ts a greatest linearly independent subset 
of J. Then G is independent mod pJ for almost every prime number p in P. 

Proor: Let W be the set of all the prime numbers p in P such that G is not 
independent mod pJ. Then to every prime number p in W there exists an 
element w(p) in J and integers w(p, g) such that 

pw(p) = yy w(p, 9)g,0 = w(p, g) < p, not every w(p, g) = 0. 
gin 
If W is infinite, then there exists, since G is finite, an element h in G and an 
infinite subset H of W such that w(p, h) + 0 for every p in H. Since w(p, h) 
and p are relatively prime, we can assume without loss of generality that 
w(p, h) = 1 for every pin H. If V is the smallest closed subgroup of J gen- 
erated by the elements # h in G, then 


pw(p) = h ¥ 0Omod V, 


i.e. J would be almost H-infinite and therefore almost P-infinite and this is 
impossible. Hence W is finite and this proves (6;2). 

THEOREM 6;3: Suppose that F is a group without elements of finite order that 
the set P of all the prime numbers p with pC(p, F) # C(p, F) is infinite and that the 
group J* without elements of finite order is almost P-infinite. Then there exists a 
group A such that F(A) = F, A/F(A) = J* and such that F(A) is not a direct 
summand of A. 

Proor: By (6;1) there exists an infinite subset W of P and a pair 
S*, T* of subgroups of J* and elements e*(p) for p in W such that 


S* is a closed subgroup of finite rank of T*, 

S* is not almost P-infinite, 

T* is generated by the elements e*(p) and by the elements of S*, 

pe*(p) = ge*(q) ¥ 0 mod S* for every pair p, ¢ of prime numbers in W. 


Then there exists an element e* in T* such that 
pe*(p) — e* = s*(p) is an element of S*. 


Furthermore there exists to every prime number p= in W an element f(p) in C(p, F) 
which is not contained in pC(p, F). 
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Let S be a group isomorphic with S* and let always s and s* correspond under 
this isomorphism. 

T is the group containing the direct sum F + S and elements e(p), e for p in 
W such that 


pe(p) — e = s(p) + f(p). 


This (Abelian) group T' satisfies: F(T) = F and T/F(T) = T*. 

Suppose now that F(T’) is a direct summand of JT. Then T = F(T) + H 
for a suitable subgroup H of T' and therefore there exists to every element x in 
San element g(x) in F and to every p in W an element g(p) in F and an element 
gin F such that x + g(x) and e(p) + g(p), e + g are contained in H. Since S 
and H are subgroups of T' it follows that 


g(x + y) = g(x) + g(y) for z and yin S. 


(*) For almost every prime number p in W the relation c(p, g(x)) = 0 holds for every 
element x in S. 

In order to prove this statement consider a greatest linearly independent 
subset G of S. Since G is finite, there exists only a finite number of prime 
numbers p such that 


c(p, g(h)) ¥ 0 for at least one element h in G. 


Therefore it follows from (6;2) that the set V of all the prime numbers p in W 
satisfying 
c(p, g(h)) = 0 for every element h of G 


and 
G is linearly independent mod pS 


contains almost every prime number of W. 
Let now s be any element in S and p a prime number contained in V. Then 
there exist integers n, d(h) for h in G such that 
ns = >, d(h)h 
hingG 
and since G is independent mod pS, it can be assumed without loss of generality 
that n and p are relatively prime. But this relation implies 


ne(p, g(s)) = e(p, ng(s)) = e(p, g(ns)) = 2» d(h)c(p, g(h)) = 0 


and therefore 
c(p, g(s)) = Ofor every pin V. This proves (*), since V contains almost every 
element of W. 
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An obvious consequence of (*) is 
(**) For almost every prime number p in W the relations 


c(p, g(x)) = 0 fortevery x in S and c(p, g) = 0 hold. 
Since T = F(T) + A = F + H, it follows that 


p(e(p) + g(p)) — (e+ 9) = s(p) + g(s(p)) 
= s(p) + f(p) + pg(p) — 9 


or 


S(p) + pg(p) = g(s(p)) + 9 


or 
c(p, f(p)) + pe(p, g(p)) = f(p) + pe(p, g(p)) = e(p, g(s(p))) + e(p, 9). 
By (**) this implies for almost every p in W: 


S(p) + pe(p, g(p)) = 0 


and this is impossible, since f(p) is contained in C(p, F) but not in pC(p, F). 
Hence F = F(T) is not a direct summand of T. 
The Theorem 6;3 is now an obvious consequence from the statement (4;4). 


7. Sufficient conditions. 
(7;1) Suppose that the subgroup B of the group A satisfies the following conditions: 
(a) A/B is a group of rank 1 without elements of finite order 6 
(b) If the orders of the elements of C(p, F(B))/p*C(p, F(B)) are not bounded, then 
A/B is not p-complete; 
(c) If the set P of all the prime numbers p such that C(p, F(B)) # pC(p, F(B)) ts 
infinite, then A/B is not P-infinite. 
Then to every subgroup B’ of B such that B = F(B) + B’ there exists’ a subgroup 
A’ of A such that 


B' < A’'andA = F(A) + A’. 


Proor: There exists a greatest subgroup C of F(B) such that pC = C for 
every prime number p. Furthermore there exists a greatest subgroup H of A 
such that 0 is the only element contained in both the subgroups C and H, and it 
is possible to choose H in such a way that H contains B’. Then as proved in 
(1;1) it follows that A = C + H. If K is the intersection of B and H, then 
B= C + K, and the pair H, K satisfies the conditions (a) — (c) exactly as the 
pair A, B and therefore we can assume without loss of generality: 





¢ This condition implies that F(A) = F(B) and that B/F(B) is a closed subgroup of 
A/F(A). 
7 This statement is of course vacuous if F(B) is not a direct summand of the group B. 
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(d) If C is any subgroup of F(B) such that pC = C for every prime number p, 
then C = 0. 

Let now 2* # 0 be any element of the group R* = A/B. Condition (c) 
implies that there exists only a finite number of prime numbers p in P such that 
z* is the p-fold of an element in R* and therefore there exists an element e* ~ 0 
in R* such that: 
for every prime number p in P either R* is p-complete or e* is not a p-fold. 

There are three classes of prime numbers to distinguish: 

The finite set U of the prime numbers u in P such that R* is u-complete. 
The set V of prime numbers »v which are not contained in P such that R* is 
v-complete. 

The set W of prime numbers w which are not contained in P such that R* is not 
w-complete. 

Note that some of these classes may be vacuous and that the prime numbers of 
P which are not contained in U are not attributed to any one of these classes. 

If r is a prime number either in U or in V, then there exist (uniquely deter- 
mined) elements e*(r, 7) in R* such that 


e* = e*(r, 0), re*(r,4 + 1) = e*(r, 2). 


If w is a prime number in W, then there exists a (uniquely determined) ele- 
ment e*(w) and a number A(w) such that 


e*(w) is not a w-fold of an element in R* and satisfies w*™ e*(w) = e*. 


The elements e*, e*(w) and e*(r, 7) generate the group R*. 
Now let e, e(w) and e(r, 7) be elements contained in the classes e*, e*(w) and 
e*(r, 7) of A/B respectively, and since e* = e*(r, 0) we choose e = e(r,0). Then 


wr e(w) — e = f(w) + b(w), 
re(r, a + 1) = e(r, 1) sa f(r, 1) + b(r, 1), 


where f(w), f(r, ¢) are uniquely determined elements in F(B), b(w), b(r, ¢) uniquely 
determined elements in B’, (since B = F(B) + B’). 

There exists a subgroup A’ of A which contains B’ and satisfies A = F(A) + A’ 
if, and only if, there exist solutions g, g(w), g(r, 7) of the equations 


. wi) (e(w) + g(w)) — (e+ g) = btw), 
r(e(r, t+ 1) + g(r, ¢ + 1)) — (e(r, 4) + g(r, 1) = 0, 2), g(r, 0) = 9, 
in F(B). Equivalent with these equations are the equations: 
wr g(w) —g = — flu), 
rg(r,t + 1) — g(r, 4) = — f(r, 4), 97, 0) = 9g. 


If the prime number q is not contained in P, ice. if g is contained in V or W, 
then C(q, F(B)) = qC(q, F(B)) and therefore by condition (d): C(q, F(B)) = 0, 
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i.e. the orders of all the elements in F(B) are relatively prime to qg. Therefore 
there exist to every given element g uniquely determined solutions g(w) and 
g(v, 7) in F(B). 

Similarly it follows from the finiteness of U that it is sufficient to solve 
the equations uh(u, 7 + 1) — A(u, 7) = — c(u,f(u, t)), h(u, 0) = hin C(u, F(B)). 

From (b), (d) and (2;1) it follows that the orders of the elements in C(u, F(B)) 
are bounded, i.e. that there exists a number m = m(u) such that wr = 0 for 
every element z in C(u, F(B)). But then the elements 


h(u,t) = Dujao wie(u, f(u, t + 9) 
are contained in C(u, F(B)) and satisfy: 


and this completes the proof of (731). 

Remark: By an argument similar to that used in proving the Theorems 5;3 
and 6;3 it can be shown that the conditions (b) and (c) are also necessary for the 
existence of the direct summands A’ containing B’ for every possible B’. 


(7;2) Suppose that the group A satisfies the following conditions: 
(a) A/F(A) ts countable; 
(b) If the orders of the elements of C(p, F(A))/p*C(p, F(A)) are not bounded, then 
A/F(A) is not almost p-infinite; 
(c) If the set P of all the prime numbers p such that C(p, F(A)) ¥ pC(p, F(A)) 
is infinite, then A/F(A) ts not almost P-infinite. 
Then there exists to every subgroup B of A which exactly represents* a closed sub- 
group of finite rank of A/F(A) a subgroup A’ of A such that 


B< A’andA = F(A) +A’. 


Proor: Let B* be the subgroup of A* = A/F(A) which is represented by the 
elements in B. Then there exists because of condition (a) a sequence B*(7) of 
subgroups of A* such that 


B* = B*(0), 
B*(t) is a closed subgroup of finite rank of A*, 
B*(i) < Beit + 1), 
B*(¢ + 1)/B*(d) is a group of rank 1, 
either A* = B*(m) for a certain integer m or A* is the join of all the B*(:). 


Assume that there exist groups A (i) for 0 < i < ksuch that B = A(0), A@ < 
A(i + 1), A(i) represents exactly B*(i). Let B(i) be the subgroup of A which 
contains F(A) and satisfies B(i)/F(A) = B*(i). Then B(k) = F(A) + A(*) and 
it follows from (b), (c) and (7;1) applied on B(k), B(k + 1) and A(&) that there 





8 T.e. B contains at most one element of a class of A/F(A). 
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exists a subgroup A(k + 1) of B(k + 1) such that A(k) < A(k + 1) and 
Bik + 1) = F(A) + A(K + 1). Hence these subgroups A(i) exist for every 
i. If A’ is the join of all the A(z), then 


Bs A'andA = F(A) + A’. 


(7;3) Suppose that the group A satisfies the following conditions: 

(a) If the orders of the elements of C(p, F(A))/p*C(p, F(A)) are not bounded, then 
A/F(A) is not almost p-infinite. 

(b) If the set P of all the prime numbers p such that pC(p, F(A)) # C(p, F(A)) is 
infinite, then A/F(A) is not almost P-infinite. 

Suppose that the subgroup B of A satisfies the following conditions: 

(c) The intersection of B and F(A) is 0. 

(d) The subgroup B* of A* = A/F(A) which is represented by elements of B is 
closed in A* and of finite rank. 

(e) D(A*/B*) exists.® 

Then there exists a subgroup A' of A such that B S A’ and A = F(A) + A’. 

Proor: If D(A*/B*) = 1, then A* is countable, since B* is a subgroup of 
finite rank. This case of (7;3) is therefore a consequence of (7;2). Hence we 
can prove (7;3) by complete (transfinite) induction with regard to v = D(A*/B*). 

Since D(A*/B*) exists, there exists a closed subgroup S** of A** = A*/B* of 
finite rank such that A*** = A**/S** is the direct sum of groups A***(w) 
satisfying D(A***(w)) < D(A**) = v. Then let A**(w) be the subgroup of 
A* which contains S** and satisfies A**(w)/S** = A***(w), S* the subgroup 
of A* corresponding to S**, A*(w) the subgroup of A* corresponding to A**(w), 
finally A(w) and S the subgroups of A which contain F(A) and correspond to 
A*(w) and S* respectively. 

Since S* is of finite rank (and therefore countable), there exists from (a), 
(b) and (7;2) a subgroup S’ of S such that B < S’ (for B is a subgroup of S) and 
S= F(A) +S’. 

Since A*(w)/S* and A***(w) are isomorphic, D(A*(w)/S* = D(A***(w)) < v 
and there exists therefore by the hypothesis of the induction a subgroup A’(w) 
of A(w) such that S’ < A’(w) and A(w) = F(A) + A’(w). 

Let A’ be the subgroup of A, generated by the subgroups A’(w). Then every 
class of A/F(A) is represented by elements in A’, since A/S is the direct sum of 
the groups A(w)/S. Furthermore every element of A’ has the form > jm 2(6) 
where the elements (i) belong to different groups A’(w). If this element is 
contained in F(A), then >>'_, 2(i) = 0 mod S, ie. x(i) = 0 mod S, since A/S 
is the direct sum of the groups A(w)/S. Since x(7) is an element of S and of an 
A'(w), x(z) is an element of S’ and therefore >>‘_, x(z) is an element of S’. But 
this implies that > a(t) = 0, since the intersection of F(A) and S’ is 0, i.e. 
the intersection of A’ and F(A) is 0 and therefore A = A’ + F(A). Further- 
more B < §’ < A’ and this completes the proof. 





* As a consequence of (3;3) it follows from this condition (e) that D(A*) exists and 
satisfies D(A*) < D(A*/B*). 
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8. Final conclusions. 

THEOREM 8;1: Suppose that F is a group without elements of infinite order, that 
J is a group without elements of finite order and that D(J) exists. Then F(A) isa 
direct summand of every group A such that F(A) = F and A/F(A) = J if, and 
only if, the following two conditions are satisfied: 

(a) If the orders of the elements of C(p, F)/p*C(p, F) are not bounded, then J is not 
almost p-infinite. 

(b) If the set P of all the prime numbers p such that C(p, F) ¥ pC(p, F) is infinite, 
then J is not almost P-infinite. 

Proor: The necessity of condition (a) is a consequence of Theorem 5;3 and 
the necessity of condition (b) is a consequence of Theorem 6;3. 

That the conditions are sufficient is a consequence of (7;3), if we choose as the 
subgroup B of A the 0-group which satisfies all the conditions. 

Since the existence of D(J) has not been used in the proof of the necessity of 
the conditions, we have even proved the 

Coro.uary 8;2: If F(A) is a direct summand of every group A such that 
F(A) = F and A/F(A) = J, then the groups F and J satisfy the conditions (a) 
and (b). 

Coro.uary 8;3: Suppose that F is a group without elements of infinite order, 
that the group J is contained in a group J’ without elements of finite order such that 
D(J") exists and the pair F, J’ satisfies the conditions (a) and (b).!° Then F(A) is 
a direct summand of every group A such that F(A) = F and A/F(A) = J. 

This Corollary is a consequence of the Theorem 8;1 and of (4;4). 

Coro.uary 8;4: Suppose that F is a group without elements of infinite order, 
that J is a group without elements of finite order and that D(J) exists. Then 
C(p, F(A)) is a direct summand of every group A such that F(A) = F and 
A/F(A) = J if, and only tf, the condition (a) is satisfied for this particular prime 
number p. 

This is a consequence of the Theorem 8;1 and of (4;5), since for primary 
groups F the condition (b) becomes void. 

THEOREM 8;5: Let F be a group without elements of infinite order. Then F(A) 
is a direct summand of every group A such that F(A) = F if, and only 7f, 

(1) the orders of the elements of C(p, F)/p*C(p, F) are bounded for every prime 
number p; 
(2) C(p, F) = pC(p, F) for almost every prime number p. 

Proor: The necessity of the conditions is a consequence of the Corollary 
8;2, since we can choose as a group J the additive group of all the rational num- 
bers. That the conditions are sufficient, is a consequence of the Corollary 8;3, 
since by (3;1) every group J is contained in a complete group J’, and since 
complete groups are direct sums of coutable groups, i.e. D(J’) S 2. 

THEOREM 8;6: Suppose that J is a group without elements of finite order and 
that D(J) exists. Then-F(A) is a direct summand of every group A such that 
A/F(A) = J if, and only if, 





10 This implies that the pair of groups F, J satisfies also the conditions (a) and (b). 
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for every closed subgroup S of J of finite rank all the (closed) subgroups of rank 14 
of J/S are cyclic groups. 

Proor: It is easily inferred from Theorem 8;1 that F(A) is a direct summand 
of every group A such that A/F(A) = J if, and only if, J is neither almost 
pinfinite for any prime number p nor almost P-infinite for any infinite set P 
of prime numbers. But this condition is clearly equivalent with the condition 
of the Theorem. 

Note that by Corollary 8;2 the condition of the Theorem is also necessary if 
D(J) does not exist. 


Tue INSTITUTE FOR ADVANCED Srupy, 
Princeton, N. J. 





It can be proved that this condition together with the existence of D(J) imply that J 
isa direct sum of cyclic groups. But there exist groups which satisfy this condition and 
which are not direct sums of cyclic groups (e.g. the additive group of all the sequences of 
integers) and for these groups the problem whether the condition of the Theorem 8;6 is 
asufficient one is still unsettled. 
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ON EQUIVALENT SETS OF ELEMENTS IN A FREE GROUP 
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1. Let G be the free group generated by the symbols ay, --- , a, and their 
inverses'a;,---, a.,, and let (a) = (a1, --- , @m) be aset of elements of G which 
generate a sub-group [. In this paper we consider two kinds of equivalence 
between sets (a) and (8). ‘The first is equivalence under the automorphisms 
of I’, generated by ‘simple automorphisms’ of the form 


(1.1) a = aa’ or asa, a, = ay (p FX, hA~X, €=+1), 


together with the ‘simple automorphism’ which replaces a; by its inverse. 
Relative to this kind of equivalence we have very little to add to a paper by J. 
Nielsen,? in which he gives a mechanical process for deciding whether or no two 
sets are equivalent and also a process for reducing (a) to one of a finite number 
of normal forms. When reduced in this way we shall describe a set of elements 
as reduced (N), and we recall that (a) is reduced (N) if it contains no two words 
of the form (AB)* and (AC)# respectively,? where 1(A) > 1(B) or > IU(C), and 
if the last half of every word with an even number of letters is an ‘isolated end- 
ing.’ That is to say, if a = ABandl(A) = UB) no other word in (a) ends with 
B or begins with B’. 

In §2 it is assumed that any empty words which appear‘ during the process 
of reduction are discarded, as in J. N., while in §4 they are retained. 

Theorem 1 below is essentially a restatement of various arguments used by 
Nielsen, while Theorem 2 adds a detail to J. N. 

The second kind of equivalence refers to the effect on (a) of automorphisms’ 
of G, two ordered sets of elements (a) and (8), both of which contain the same 
number of words, being equivalent if a, corresponds to 6,(A = 1, 2, --- ) insome 
automorphism of G. That is to say they are equivalent if there is an auto- 
morphism 


, ’ 
a; — fa, a’) = fila, so* yp, Ay,°°° » Gy), 





1 As in a recent paper (J. H. C. Whitehead, Proc. L. M. S., 41 (1936), 48-56, which will 
be referred to as W. G., W’ will be used as an abbreviation for W—. 

* Mat. Tidsskrift, (1921, B), 77-94. This paper will be referred to as J. N. 

3 1(W) stands for the number of letters in W. Here we assume that each word in (a) is 
reduced (E£) (see the conclusion of this section). 

‘ This can happen after a transformation of the form (1-1), where ae = ay but p #2. 

>For a far reaching account of the group of automorphisms of @ see Nielsen, Math. 
Annalen, 91 (1924), 169-209. 
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such that 
By(a, a’) = al f(a, a’), f'(a, a’)). 


Nielsen® has shown that the automorphisms of G are generated by simple auto- 
morphisms of the form a; — aja;', a, — ax (j,k ¥ 7), together with a; > ae 
Thus the problem is to determine whether or no an ordered set (a) can be 
transformed into an ordered set (8) by a sequence of what I have called simple 
transformations,” among which we now include a; > a,. In §3 this problem is 
solved for finite sets. We adopt the convention followed in W. G. relative to 
true and cyclic words, a cyclic word being a sequence of letters in cyclic order, 
and in §3 any statement about a set of words will apply equally well to a set of 
true and to a set of cyclic words. Thus the equivalence problem is solved both 
for ordered sets of elements and for ordered sets of classes of elements. 

We shall follow H. Seifert and W. Threlfall® in using the symbol = to denote 
equality between words and = for equality between elements. Thus the true 
word abb’ = a but # a and the cyclic word bab’ = a. If W(a, a’) contains no 
syllable zz’ we shall describe it as ‘reduced (EZ)’ or as ‘reduced (E, a)’, and 
‘reduced (EZ, a)’ will have a similar meaning relative to F(a, a’), where a, = 
a,(a, a’). 


2. Let I be the sub-group of G generated by (a), where 
a, = a,(a, a’), 
and a, is reduced (#). Let 
W(a, a’) = F(a, a’) 
= Wa, a’), 


be any element in I’, where F is reduced (EZ, a) and W° is reduced (FE, a). We 
represent W by an oriented polygonal segment S(W), in a Euclidean plane, the 
oriented edges of which represent the letters of W in their correct order, a; 
pointing the same way as S(W) and a; pointing backwards. Whenever a sylla- 
ble rz’ is omitted during the reduction W — W? we identify the corresponding 
edges of S(W) in such a way that their common end point is on the left hand 
side of the new S(W), positively described. Finally S(W) will have been trans- 
formed into a figure S°(W), consisting of a segment S°(W°) with a row of trees 
T;,--., 7, attached to its left hand side, and in spelling out W each edge of 
S\(W°) is described once and each edge of 7’, twice, with opposite senses. The 





*Math. Annalen, 79 (1919), 269-72. 

'W.G.,§1. A simple transformation of a set of words (W), consists either of replacing 
« by (zy)*(y # x or x’) whenever x‘ occurs in (W), or of an elementary transformation, 
which consists of omitting a syllable zz’ (elementary reduction) or of inserting one (ele- 
mentary insertion). 

* Lehrbuch der Topologie, Leipzig (1934), chap. 12. 
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vertex in which 7, meets S°(W°) will be called its base, and the other extremities 
of 7’, will be called its leaves. It is to be understood that each tree has but a 
single edge incident with its base. Thus two or more trees may have a common 
base. By the length l(s), of any segment? s in S°(W) we shall mean the number 
of edges in s, and h, will stand for the longest segment joining a leaf of 7’, to its 
base. Weassume 7), --- , 7’, to be the order in which the trees occur in spelling 
out W and shall denote by J, the length of the segment, possibly zero, joining 
the base of 7’, to the base of 7,41, To and 7,4: being the first and last vertices of 
S°(W°). Finally, we shall describe 7’, as a stump if it has but a single leaf. 

THEOREM 1. If (a) is reduced (N), then each tree in S°(W) is a stump, no word 
in F(a, a’) ends at a vertex of T, other than a leaf, and 


all, = | hpi — hy| (ho = hoy = 0), 
b ll, > 0. 


This theorem, being vacuous if F = 1, will be proved by induction on [(F), 
measured in terms of the a’s. We first show that no word in F, except possibly 
the last, ends at a vertex P, of 7’,, which is not a leaf. For if it does, let 


F(a, a’) = Fy(a, a’) Fe(a, a’), 


where F, ends at P. If P is the base of T,, Jet L be any leaf, and if P is not the 
base let L be any leaf which is separated from the base by P. Then the branch 
consisting of all the edges of 7’, which are not separated from L by P is a tree 
belonging either to F, or to F2, say to F,. If 


Wi(a, a’) - Fi(a, a’), 


the figure S°(W,) has a tree growing out of its last vertex, contradicting the 
hypothesis 1, > h,, calculated for S°(W,). A similar contradiction arises if a 
branch above P belongs to F%. 

If S°(W) contains no trees there is nothing to prove. Otherwise let 


(2.1) 


F = a Fi(a, a’) = ay Wi(a, a’), 
where 
a, = AB, W, = B'U(a, a’), 


and the last letter in A is not the inverse of the first letter in U. From the 
preceding paragraph it follows that the last vertex of a{ either precedes the base 
of 7; or is a leaf of T;. In the first case the theorem follows immediately from 
the hypothesis of the induction. In the second case let af, be the second word 
in F. From the hypothesis of the induction it follows that every tree in S°(W;) 
is a stump, and if 7, were not a stump a would be completely absorbed by B 
and the beginning of the third word in F, contradicting the assumption that 





* By a segment we shall always mean a non-singular segment. 
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(a) is reduced (NV). Therefore T; is a stump, aj is of the form B’C, where 
\(B) < U(C), and T; spells out B, read from base to leaf. Since (a) is reduced 


(N) we have 
lb = (A) 2 UB) = h, 
Lh +h, = UC) = hy, 


and the remaining inequalities (2.1a) follow from the hypothesis of the induction, 
applied to S°(W1) 

If 1, = 0 the word starting at the leaf of 7, would finish at the leaf of 7... 
and would be absorbed into its two neighbours, contradicting the assumption 
that (a) is reduced (N). Therefore 1, > 0, and the theorem is established. 

Two important results'® are obvious consequences of this theorem, namely: 

If the set (a) ts reduced (N) it is independent," and: 

If (a) is reduced (N) and T = G each ais an a. 

It also follows” that 1(W°) = 1(é), where é is any at in 


F(a, a’) = Wa, a’). 


For let the segment representing ~ occur after the leaf of 7’, and before the 
leaf of T,41. Then l(t) S h, +1, + Apys, and from (2.la) we have 


bh+---+,hi12h,, hon Slat --- th. 


Therefore 
l(W°) == lo + eee + 1, r he tl, + Nour y 3 U(é). 


If (W°) = l(é), & being one of the longest words in F, it follows that l(é) = 
h, + l, + hows and that he + fy = Nos ifo < Pp, he = ke + Ross ifo> p. Let 
this be the case and let &, --- , & be the longest words in F, all of equal length, 
and let m,---+, m be the remaining words, if there are any, each being shorter 
than the ’s. I say that 


F= F,(n, n’)Fo(&, £’)Fe(n, 1’); 


where Fy = €© or e, t,. Forif F begins and ends with a £ the assertion follows 
from (2.1b) and the fact that 


ifq > 0, while «, = — 1 and e = Lif 
Fe < 


ee 
J. N., theorem 1, and Math. Annalen, 79 (1919), 269-72. 

“ This means that W(a, a’) ¥ 1, where W(a, a’) = F(a, a’), if F # 1 and F is reduced 
’ a). 
*J.N., p. 91. 
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since the last half of each é is an isolated ending. If F begins or ends with an n° 
we have lk = h; or 1, = h, and, removing the extreme 7‘, the assertion follows 
from induction on the number of words in F. It also follows that each 7° in F; 
is an 7’ and each 7 in F? is an 7, since the last half of any 7 with an even number 
of letters is an isolated ending. 

By a level transformation we shall mean either a transformation of the form 
(a) — (a*), where (a*) is any independent set of elements among Q1, > ++, Om, 
a,,-+:, a, ora transformation of the form (1.1) such that I(a}) = Ue). Thus 
a, = aa, is a level transformation if, and only if, a, = AB, a, = B’C, where 
1(B) = U(C) and AC is reduced (EZ, a). Notice that the inverse of a level trans- 
formation is a level transformation. We shall describe (a) as a minimal set if 


Ke = = Kon) 


is a minimum among all sets of generators for [T. From the rule given by 
Nielsen for reduction (N) it is clear that the reduction of any minimal set to 
one which is reduced (N) consists of a sequence of level transformations, and 
from J. N., theorem 6, it follows that any reduced system (N) is minimal. 

THEOREM 2. Any two minimal sets of generators for T are interchangeable by 
level transformations. 

Let (a) and (8) be any two minimal sets of generators for T. From the 
preceding paragraph it follows that no generality is lost in assuming (a) and (8) 
to be reduced (N). Then they both contain the same number of elements, 
say m, and, assuming them to be ordered so that 


U(ay) Ss U(ay), U(B,) S 1(B,) 

if \ < yp, we have ® 
U(ay) = (By) = ky (A = 1,---,m). 
If m = 0 the theorem is vacuous, and we shall prove it by induction on m. 
Let k, be the first of k,, --- , km whichis thesameask,. Ifs > landA<s <p 
we have 1(8,) < Ua,), Wax) < U(B,) and it follows from a previous result that 
(81, --» , Bs-1) depend only on (ay, «++ , @—1) and vice versa. Any sub-set of a 
minimal set is obviously minimal and if s > 1 it follows from the hypothesis 


of the induction that (6, --- , Bs—1) — (a1, «++ , @-1) by a sequence of level 
transformations. Moreover, it follows from a previous argument that 


(a1, *++ 5 sr, Bs, -°- » Bm) —> (a, st y Hel, Ya, *** » Ym) 
by level transformations, where 


/ 
VA = @,, OFr ay, Ae, (s SA, pr,rnFZ m). 





13 J.N., p. 91. 
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If s = 1 it follows from the same argument that (8) = (y) is related to (a) 
in this Way. 

If each y is an a‘ a further level transformation (y) — (a,, --- , am) carries 
(aj, ++) @s—-1y Ya) *** y Ym) into (a). Otherwise we can arrange the notation 
so that 


Y = a (A =2s,---,2@) 
’ ; ‘ 


Yu =a Ae (u=t+1,---,m), 


Pu pb 


with the convention ¢ = s — lifnoyisanat. If p, > t,o, > tfor every value 
of u the determinant of the linear transformation 


Tr 
=a —-a@% =-—-a a 
Yu un Pu yt “a? 


calculated to the modulus 2, would be zero. Therefore some y is an a, and 
», < toro, < t for at least one value of u, say Yiu1 = a,az. Theno >t, or 
the set (y) would not be independent, and we may suppose o =t + 1. Apply- 
ing the level transformation 


* 
Tea TeTur. ™ Mas 


to (y), the theorem follows from a second induction on m — t. 

It is a straightforward matter to write down all the sets obtainable from a 
given minimal set (@°), by sequences of level transformations. For let (a'), --- , 
(a) be all the sets obtained from (a°) by single level transformations. Omitting 
those which already appear in the sequence, let (a“'**), --- , (a“*) be the totality 
of sets obtained from each of (a!), - - -, (a) by single level transformations, 
and let (a*t"), ... , (a**+) be similarly obtained from (a**-'*"), ... , (a**) 
(s= 1,2, -..). This process will come to an end after a number of steps which 
is bounded above by (2p)*‘*”, since this number exceeds the number of sets 
(), for which L(¢) = L(a@®). If (£) is any set derived from (a) by a sequence 
of level transformations let (¢¢+!) be derived from (£?) by a level transformation, 
where p = 0, --- , N, (&) = (a), (&”) = (&), and" (é*) # (&) if p #0. Then 
it follows from induction on p that (£) is one of (a*-),---, (a‘*), where 
y= 1. Therefore the sequence («°), (a'), --- contains every set which can be 
obtained from (e°) by a sequence of level transformations. Thus a reduced 
set (8) will be contained in this sequence if it is equivalent to (@°). 

‘In general’ no level transformation of the form (1.1) is applicable to a re- 
duced set (a). In such a case we have only to compare (8, 6’) with the various 
permutations of (a, a’) to discover whether or not (a) is equivalent to a given 
reduced set (8). 

Following the analogy with linear dependence, one might enquire whether or 


a ee 


‘Tf (g?) = (#") where o > p we should simply omit (£*1), --- , (&) from the sequence. 
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no it is always possible to select an independent basis for T from a basis between 
whose elements there are one or more identical relations. The example 


a = a’ba'b’, B = Bab’a’, y = aba’b’ 


shows that the answer is in the negative. For a = ay’, b = By, but no two of 
(a, B, y) generate the free group G(a, 6). 


3. We now consider the equivalence of ordered sets of elements under the 
automorphisms of G. A simple transformation, operating on a set of words 
(W), true or cyclic, will mean the same as in §1 of W. G., referred to in §1 above, 
except that we shall include a, — ay among the simple transformations. By a 
transformation of type T, or a T-transformation, we shall mean a transformation 


of the form” 
(3.1) aj? > a;P a, (k fixed, p = 1, ---, rm, 1%, #h), 
followed by what we shall call a complete elementary reduction, meaning a trans- 
formation by elementary reductions to words which are reduced (£). Here 
it is to be understood that a; ? ~ aj? if p#a, but we allow 1, = t, €& = —6, 
in which case 

aj, —> Q;, Aj, 4; . 

If (W) is reduced (£) and if (W*) is obtained from (W) by a T-transformation 

we shall describe the latter as a reduction (T), or as a T-reduction, if 

L(W*) < L(W), 
and as a level transformation if 

L(W*) = L(W). 
We shall also describe an automorphism of the form" 

a;— a5? 


as a level transformation. A set to which no T-reduction is applicable will be 
described as minimal, or as minimal (7’) if confusion with the minimal sets of §2 
is to be feared. The latter may then be described as minimal (N). The total 
number of T-transformations is finite and one can find out if a given one, applied 
to a given set, is a reduction or a level transformation or neither. Also the sets 
(W*), such that (W) — (W*) by level transformations, can be written down by 
a process similar to the one described above. 

We shall also introduce a new use of the term elementary transformation. 
By an elementary transformation we shall now mean any one of the following: 





16 Cf. W.G., §2. 
16 Such a transformation is a product of simple transformations (Nielsen, Math, An- 
nalen, 79 (1919), 269-72). 
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1. The identical transformation, 
2. An elementary reduction or insertion, these being defined as usual, 
3. A transformation of the form" 


Wy, 2'Wyx (A = 1, .--.,m), 


where W, may be true words provided z is the same for each value of \, 

4. The resultant of any sequence of these. 

It appears that the equivalence test is analogous to the one described at the 
end of §2. If we reduce (W) to a minimal set (W°) by a random sequence of 
T-reductions, all the minimal sets which are equivalent to (W) can be obtained 
from (W°) by level transformations. 

To prove this we shall use the model for (W) described in W. G. §3, but will 
denote the 2-spheres in M by 2; instead of T;. Let 2}, --- , 2° be any set of 
oriented, non-intersecting surfaces in M, and let w, --- , wm be any set of ori- 
ented circuits'® in M which, with the 2-spheres 2, represent the words (W). 
Then the surfaces 2°, together with the circuits w, represent a set of words (W°), 
a positive intersection >$-w, being marked a;, and a negative intersection 
>'.w, being marked a.. Thus the substitution 2, 2!, which consists of re- 
placing 2; by 2? in the machine for spelling out words, determines a trans- 
formation W, > W. 

Now let the surfaces 2}, --- , 2? be non-singular 2-spheres, no linear combina- 
tion of which bounds, and let S}, --- , S? be the connected surfaces in the model 
R, obtained by cutting M along >, --- , 2, (ef. W. G. §3), which correspond 
to regions on the 2-spheres 2°, the surfaces S° being bounded, in general, by 
circuits on the 2-spheres C; and C;. Using a method due to J. Singer," as 
adapted in W. G. (p. 54), we shall prove two lemmas, of which the first is: 

lemma 1. The 2-spheres 2° are similar to the 2-spheres = in that M is trans- 
formed into a punctured sphere? R°, by cutting along 2}, --- , =}. 

To prove this we modify Singer’s argument as follows. We replace his 
diagram A’ by our model R, his circuits e by our 2-spheres 2° and his circuits e’ 
by our 2-spheres 2. We replace the points P; and Ps of the third paragraph 
on his p. 106 by a circuit belonging to some intersection D{-2, which bounds a 
element on Z? containing no other circuit belonging to an intersection 2{ -2. 
We also define moves analogous to his IA, IB and IC. A move of type IA 
changes the orientation of a 2-sphere = and determines a simple transformation 
ofthe form a;—>a;. Let C* be a 2-sphere in R which either 





On p. 50 of W. G. it is shown that such a transformation is the product of simple trans- 
formations. 

'S As in W. G., all the chains referred to will have general intersections, relative to slight 
deformations, unless the contrary is stated. 

" Trans. American Math. Soc., 35 (1933), pp. 105-6. 

** By a punctured sphere is meant a bounded manifold which is equivalent to a 3-sphere 
from which a (finite) number of 3-elements have been removed, no two of the latter having 
4 common vertex. 
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1. separates some C§(= C;, or C’.) and no other 2-sphere C{ from C;‘, or 

2. separates C$/ and precisely one other 2-sphere Cj‘, from Cj“. 

We cut R along C* and re-attach the two regions so formed by uniting corre- 
sponding vertices of C; and C’;._ This is a move of type IB in the first case and 
of type IC in the second. 

Let U be the region, bounded by C* and C§, which is detached from R in a 
move of type IB, and let C*’ be the side of C* incident with U. Let U* be the 
region into which U is transformed by uniting C; with C ‘, let Ri = R—U, 
R* = R, + U* and let E be a 3-element bounded by C*’ but having no internal 
component in common with R. Then R, and U are punctured spheres, 
U* + E is a 3-element, Ri + (U* + £) is a punctured sphere and R* = 
(Ri + U* + E) — Eis a punctured sphere. Thus a move of type IB trans- 
forms R into another punctured sphere. A similar argument shows that a move 
of type IC also transforms R into another punctured sphere, and Singer’s argu- 
ment shows that one can, by a series of these moves, transform R into a punc- 
tured sphere R°, whose boundary 2-spheres correspond in pairs to the 2-spheres 
>°. Thus lemma 1 is established. 

The second lemma is: 

Lemma 2. The set of words (W°) is equivalent to the set (W). 

First let R° = R, that is to say let each 2° be a 2. Then we can transform 
(W) into (W°) by a change of notation 


€ >; 
a;—> @;'. 


Now let & be transformed into R° by a single move of type IB or type IC, and 
first let this be of type IB. Rearranging the notation, if necessary, let 2} = 2, 
if? #7. Then S$, the image of D$ in R, is the 2-sphere C*, and we may assume 
that S§ separates C;, rather than C 4 from the rest of the 2-spheres C;, and that 
{is oriented so that the positive side of S} is the one remote from C;. Then 
the transformation 2; > 2}, with due regard to orientation, can be regarded 
as the result of an isotopic deformation of 2;into 2}. Such a deformation is the 
resultant of deformations of the form 


ZT—T+EF (mod. 2), 


where E is the boundary of a 3-element E, which meets = in a 2-element f, on 
E, and nowhere else. After a suitable subdivision and slight deformation of the 
circuits w, the intersections 2.w will be general after each stage of the def- 
ormation 2;—> 29. Thus if (W) consists of cyclic words, or if (W) consists of 
true words and none of the 3-elements E contains O, the deformation 2; — 2; 
will determine an elementary transformation of (W), since the intersections 
2-w appear and disappear in consecutive pairs, those in each pair differing in 
sign. If (W) consists of true words and if some E contains O we modify the 





*1 The transformation from U to U* consists of replacing the vertices of C$ by the cor- 
responding vertices of Cs ' 
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deformation as follows. Let E° be a 3-element in Z whose boundary meets £ 
in a 2-element contained in f and nowhere else, and such that E° meets each 
segment of w issuing from O in a single vertex, and otherwise does not meet w. 
For example, after a suitable subdivision of Z, we may take 


E* = N(s, E), 


where s is a segment joining O to f and N(L, K) stands for the aggregate of 
components in a complex K which meet L. Then we take the deformation 


Z—-rT+H# 
in two stages, namely 
PH>lT+ Hh 
and 
+H 524 + (E+ B. 
The first stage determines an elementary transformation of the form 
W, — a5W,a;* 


and the second stage also determines an elementary transformation. Thus in all 
cases 


~—2+8, 
and hence the complete deformation. 
2; 25, 


determines an elementary transformation of the words (W). 

Now let R be transformed into R° by a single move of type IC. As before 
let S} cover C; positively and let the detached region be bounded by S}, C; and 
C;. Let « stand for the move R — R° and let u* be a move of type IC in which 
Ris cut along a 2-sphere C*, separating C; and C; from S$. Then uz is the re- 


sultant of u*, followed by a move of type IB. After a suitable subdivision of 
R let 


c* = N(C;+ 8+ Cj, R), 


where s is a segment joining C; to C; in the region bounded by C;, C; and 3}, 
and the positive side of C* is the one remote from C;. According to an argu- 
ment in W. G. (p. 54) the move u* determines the simple transformation a; — 
4,4; of the words (W), C* being eventually renamed C;. Therefore » determines 
the resultant of the simple transformation a; — a,a; followed by an elementary 
transformation. 

In the general case the lemma now follows from induction on the number of 
moves in the transformation R — R°. 
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Let n; be the number of times a; and a; occur in (W). Then the circuits w 
pierce 2, in n; vertices, and we shall prove the lemma: ; 

Lemma 3. If (W) is a minimal set (T) and if C* is a 2-sphere in R which 
separates C, from Ci, then 


n* z Nk y 


where the circuits w pierce =*, the image of C*, in n* vertices. 

Let C* separate Cj, and also C§!, --- , C{" say, from Cz‘, with the convention 
that r = Oif C*isisotopictoC;. Ifr = 0 the transformation of (W) determined 
by = —  * is an elementary transformation. Otherwise let s, be a segment 
in the region bounded by C;, C{!, --- , C§" and C%, joining C; to C§ e, and, after 
a suitable sub-division, let 


C+ =NCL+ C8 +---4+CR +at--- +8, R), 


oriented so that the positive side of C** is the one remote from C;,. Let 2** 
be the image of C** in M. As in the case of a single move of type IC (i.e. the 
case r = 1), it follows, from an obvious extension of the argument in W. G., that 
the transformation of (W) determined by 2 — >** is the transformation (3.1). 
Therefore the transformation determined by = — =* is (3.1) followed by an 
elementary transformation. 

In either case, let 


zi = 2, if «x k, >; = 2°, 


and let (W°) be the set of words determined by the circuits w and the 2-spheres 
>*°. Then 


L(W°) = L(W) + n* — nm, 


and we have only to show that L(W) < L(W’°). 

If r = 0, L(W) s L(W°) since (W) — (W°) is an elementary transformation 
and (W) is reduced (Z). If r > 0 let (W*) be the set of words derived from 
(W) by the T-transformation consisting of (3.1) followed by a complete ele- 
mentary reduction. Then L(W*) < L(W°), since (W®) is derived from (W) 
by (3.1) followed by an elementary transformation which may or may not be a 
complete elementary reduction. Since (W) is minimal L(W) < L(W*). There- 
fore in either case 

L(W) s L(W°®) 
and the lemma is established. 

Let (W) and (W°) be two sets of words represented by the intersections of a 
set of circuits w® with sets of independent 2-spheres = and 2°. In W. G. (p. 53) 
I described a method for replacing 2° by a set of surfaces having the same inter- 
sections with w® and having normal intersections with the 2-spheres 2. That 
is to say each of the new surfaces S° meets a given one of the 2-spheres C; ina 
single circuit, if at all. Here we shall need the result stated in the lemma: 

Lemma 4. The circuits w° can be replaced by a set of circuits whose intersections 





W 
ag 


ar 


th 


is 
tex 
ing 
car 
an} 


ane 


bor 
stit 
For 
inte 
moj 
st 
- : 

22 
and. 


24 


more 








i 
nh 


S 





ON EQUIVALENT SETS OF ELEMENTS IN A FREE GROUP 793 


with = also represent the words (W) and, at the same time, =° can be re placed by a 
set of independent 2-spheres having normal intersections with >, whose intersections 
with the new circuits represent the words (W°). 

Unless the intersections 2-=° are already normal there is a surface S$ such 
that two circuits of 83 are connected by a segment pq, on some C{, whic h does 
not meet any other circuit belonging to an” 8°. Let E, and E2(E2, D q) be the 
2-elements on C{ which are bounded by these circuits of S$ and which do not 
contain pg. After a suitable sub-division let E> be the 2-element on the bound- 
ary of N(E2, R) which is inside R and is bounded by the circuit on the boundary 
of N(Bs, S3) which is inside R, and let pg* be a segment on S9 joining p toa 
vertex g* on E3. Let T be the solid tube in R bounded by 


E, + E; + N(E,, S$). 


We shall replace 7 by a 3-element 7*, defined as follows. Let R be sub-divided 
again in such a way that 


F = N(E», T) and F* = N(E%, T) 


are 3-elements which do not meet each other. Let ¢ consist of all the solids in 
R which meet gp + pq* and lie on the same side® of S$ as pg. Then we assume 
the second sub-division of R to be such that 


T* =F +it+ F* 


isa3-element. We map 7 semi-linearly on 7* in such a way that each ver- 
tex of E, and E> corresponds to itself. If one of the segments in R correspond- 
ing to asegment of some w pierces E; it will also pierce E3 without leaving 7, be- 
cause of our understanding that all intersections are general. Similarly 
any portion of an S° which lies inside 7 is a cylinder bounded by circuits in 
F, and E;. We replace these segments and cylinders™ by their images in 7%, 
and Ss by 
Ssot+T+T*, 
which is a surface meeting Cf in the same circuits as Sg and also in a strip 
bordering pq, since addition is mod. 2. The corresponding alteration in M sub- 
stitutes other 2-spheres for any of the 2-spheres 2° which have been modified. 
For we have not altered the topology of these surfaces by substituting cylinders 
internal to 7* for cylinders internal to 7. In altering T itself we have merely re- 
moved the cylinder bounded by E, and £3, deformed £3 by removing a strip on 
85 s bordering pq* and then restored a cylinder bounded by £, and the image of 
E} in the deformation. Since all the chains have general intersections neither the 
_—-_e-— 
2W.G., p. 53. 


** The surfaces S° are two-sided, since they correspond to regions on the 2-spheres 2°, 
and EF, and E* lie on the same side of S, , pg and ¢ lying on the other side. 


* Though addition i is mod. 2 we shall sometimes use the minus sign to make the formulae 
more explicit. 
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first cylinder nor the strip bordering pq* meets any of the segments corresponding 
to the circuits w. Any segments which previously ran through T are now re- 
placed by segments in 7* with the same end points. Thus the circuits w and 
the 2-spheres 2° are replaced by new sets of circuits and 2-spheres, say w* and 
>*, such that the intersections 2-w* and =*-w* are the same as Y-w and >°.w 
respectively. 

Any one of the cylinders on a surface S°® which ran through T, together with 
2-elements on E, and E3, bounded a 3-element 7), in T, and the cylinder in 
T* which replaces it, together with the same 2-elements, bounds a 3-element 
Tx, in T*. Let T = T, y; = 7* and let 7, and T) be the respective images 
of T, and Ty in M(\ = 0,---,n). Then either 


=>? 
or 
B= Tet ttt-:- +F +h. 
Therefore 


DT~ d (kK =1,---,p), 


and the 2-spheres =* are independent. 

The circuits £, and £, may now be united by a slight deformation which car- 
ries the strip bordering pq through C { into the neighbourhood of C7‘, and the 
lemma follows from induction on the total number of circuits in the boundaries” 
_ ae 8° ; 

As a final lemma we prove the converse of lemma 2, restricting ourselves 
to the case where (W) and (W°) are reduced (£). 

Lemma 5. Any two equivalent sets of words (W°) and (W), both reduced (E), 
can be simultaneously represented by the intersections of a single set of circuits w, 
with sets of independent 2-spheres 2° and 2. 

Let (W°) and (W) be equivalent sets of words, both reduced (EZ), and let (W°) 
be represented in M by the intersections of a set of circuits w*, with a set of 
independent 2-spheres 2°. From an argument in W. G., §1, it follows that the 
transformation (W°) — (W) can be exhibited in the form 


(W) = Ut, --- i(W°), 


where ¢, is a simple transformation of the form z — zy, not followed by any 
elementary reductions, and U consists only of elementary reductions, (W) being 
reduced (£), and transformations of the type x — x’. As in W.G. (p. 54), it 
follows from induction on yu that 


(W*) = t,--- h(W°) 





** From the existence of the original segment pg, it follows that the relation ‘having 
normal intersection with’ is symmetric between > and =°. 
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is represented in M by the intersections of the original circuits w* with a set of 
independent 2-spheres 2. In the presence of lemma 4 we lose no generality in 
assuming the intersections 2-2° to be normal, which we accordingly do. 

As in W. G. (p. 54), where (W°) is an independent set of letters, we have to 
show that the reduction (W*) — (W) can be copied by a deformation of the cir- 
cuits w* into a set whose intersections with 2° and 2 represent (W°) and (W) 
respectively. Let PQ be the segment in R corresponding to a syllable xz’ in 
(W*), say a;a;. Since the intersections =°.= are normal, P and Q are joined 
by a segment s, on C;, which does not meet any one S$ more than once, and PQ 
can be deformed” into sin R. As we copy this deformation in M the intersections 
w*.2° may be modified. If so it follows from an argument similar to one used in 
proving lemma 2 that the new intersections w*-2° will represent a set of words 
which are derived from (W°) by elementary reductions and insertions. But an 
argument used in W. G. shows that the number of intersections s-S° does not 
exceed the number of intersections PQ-S°. That is to say, the number of inter- 
sections w*-2°, and therefore the number of letters in (W®), is not increased. 
Since (W°) was originally reduced (£) it follows that the set of circuits w* con- 
tinues to represent (W°) after the deformation. As in W.G., the elementary 
reduction which consists of omitting a,a, may now be copied in R by deforming 
s through C; into the neighbourhood of C;. Repeating this process, w* may be 
replaced by a set of circuits w, whose intersections with 2° and = represent (W°) 
and (W) respectively. Thus the lemma is established. 

We are now ready to prove: 

THEOREM 3. Any two equivalent minimal sets (T’) are interchangeable by level 
T-transformations. 

The inverse of a T-transformation, applied to a set which is reduced (£), 
isa T-transformation. Therefore the inverse of a level transformation is a level 
transformation and it is enough to show that two equivalent minimal sets (W°) 
and (W) can be identified by level transformations applied indiscriminately to 
either or both of them. 

Let (W°) and (W) be represented in M by the intersections of w with inde- 
pendent sets of 2-spheres 2° and 2. First let us assume that each 2° either 
coincides with some 2 or does not meet any 2. Let the notation be such that 


Bi, «++, 
do not meet any 2, while 
= =2,ifp>k 


(possibly k = p, in which case no Z° meets a Z). If k is zero (W) = (W®), and 


we shall use induction on k. 
We have 


De 152i (summed for j = 1, --- , p), 


. 





*® We need not concern ourselves with the fact that PQ may cross itself or other segments 
of 9(w*) during this deformation. 
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where r,; are calculated mod. 2. Also | 7,¢| = 1 (p, o = 1, --- , k), since the 
>"’s are independent. Rearranging the notation, if necessary, we may suppose 
ri = +++ = Te = 1. Then S°, the image of 2) in R, separates C, from ¢ 


and since (W) is a minimal set it follows from lemma 3 that 


0 
nN, = No» 


where n? is the number of intersections >!-w. Since n} = n, if i > k we have 
LWW’) =ni+---+n,2n+--- +n, = LW). 
Similarly 
L(W) 2 L(W°) 
and it follows that 
L(W) = L(W). 


Therefore?’ n? = n,. We have seen that the transformation of the words (W) 
determined by the substitution 2, — 2? can be exhibited either as an elementary 
transformation U, or as a transformation of the form (3.1) followed by an ele- 
mentary transformation U. In the first case U is the identity, since n} = n, 
and the words (W) are reduced (EZ). Inthe second case, U is a complete elemen- 
tary reduction since the set (W) is minimal. Otherwise the T-transformation 
determined by (3.1) would be a T-reduction, since n?, = n, and any elementary 
reduction applied after U would reduce the number of letters in the new words 
(W). Therefore 2, — >} is a T-transformation. Therefore it is a level trans- 
formation and this special case of the theorem follows from induction on k. 

Returning to our general rule that all intersections are to be general relative 
to slight deformations, we shall complete the proof by induction on the number 
of circuits in which 2 and 2° intersect. The process of normalization used in 
proving lemma 4 reduces the number of circuits in the intersections 2- 2”. 
Therefore we may assume the latter to be normal. 


Let S3 be any bounded S° and let 
8s=ce+tet+--- +e, 
where c is on C{ and ¢, is on C5? (possibly C; °° = C;* for some p). Let E be 


one of the 2-elements into which C§ is separated by c, let E ; be one of the 2-ele- 
ments into which Ci? is separated by c, and let 


Fm S$+ El + +20. 


Let M(K) stand for the image in M of any complex K in R, let 2° be the >’ 
containing M(S3) and let E° be the 2-element on $ which is bounded by M(c) 
and contains M(S3). Using induction on the number of circuits belonging to 





27 Cf. J. N., p. 91. 
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y.>° which are inside E°, we shall define the proper one of the two 2-elements 
into which C§ is separated by c. If 


(3.2) M(C; —-E+F)~xuX, M(E+ F)~ yr: 


(summed over t), we have z; + y; = 1, the 7 being the same as in C{. If E° 
does not meet any 2 except in its boundary, 83 consists of a single circuit. 
Otherwise we choose E : on Cr"? such that its image E,, on C$ 2, is the proper 
2-element bounded by c,, the image of c, . Ineither case the proper 2-element 
E, on C§, is to be the one such that the y; of (3.2) is zero and z; = 1. 

If k is the number of intersections M(E)-w, E being the proper 2-element of 
the last paragraph, and if k° is the number of intersections E°.w, I say that 


ks k*. 


For let k, be the number of intersections M(E,)-w and k? the number of inter- 
sections EZ? .w, where E° is the 2-element in Z° bounded by M(c,). Then 


(3.3) W=P+ki+--- +k, 
where s° is the number of intersections M(S3)-w. Since 
M(C; -E+F)~2%i+--- 


it follows that the 2-sphere C{ — E + F, after a slight deformation in R, separates 
C; from C e The number of intersections M(C§ — E+ F)-wisn; —k + s°+ 
ki + --- + k,, and it follows from lemma 3 that 


(3.4) KS 8 +h+.-.-- +h. 


If 8} consists of the single circuit c, k, = 0 and k® = s°. Otherwise, assuming 
k, Sk’, the required result follows from (3.4), (3.3) and induction on the 
number of circuits of 2-2 inside E°. 

Let 2; be one of the 2-spheres 2 which meets a 2-sphere in the set 2°. Let 
cbe an ‘extreme’ circuit belonging to 2,;-2°, meaning that one of the 2-elements 
into which 2; is separated by c contains no other circuit belonging to 2;->°. 
Let A be the 2-element in question and let E° be the 2-element bounded by c on 
2}, the 2° which contains c, such that z; = 1 in the homology 


To - EB +A~z,d}. 


Now cut M along the 2-spheres 2° to form the model R°, referred to in lemma 1. 
The image in R° of 2° — E° + A, slightly deformed, is a 2-sphere which separates 
the two images of 29 from each other. The number of intersections (23 — E° + 
A)-w is n$ — k®° + 1, where k® is the number of intersections E°.w and 1 is the 
number of intersections A-w. Therefore 


io, 


according to lemma 3. If 1 = k® it follows from a previous argument that the 
substitution = > >° — E° + A determines a level transformation of (W®). 
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Since it reduces the number of intersections 2-Z° the theorem, in this case, 
follows from the hypothesis of the induction. If 1 > k° let C{ be the image in 
R of the side of 2; incident with H°. If R(A), the image of A in R, were the 
proper 2-element bounded on C{ by R(c) we should have! S k°. Therefore 


E = Cj — R(A) 


is the proper 2-element, and if k is the number of intersections M(E) -w we have 
k < k® <1. That is to say, more than half the intersections 2 ;-w lie in A unless 
the substitution 2 — 2} — E° + A determines a level transformation of (W°). 

But the 2-element 2; — A also contains a 2-element A, which is bounded 
by a circuit of ;-5° and contains no other circuit of Z;-2°. Adding suitable 
subscripts to indicate that A is replaced by A we see that one or other of the 
substitutions 


Yo 25 —- +A, T3235, - Fi+A, | 


determines a level transformation of (W°). They both reduce the number of 
circuits in >-D°, and the theorem is established. 

From this theorem it is clear that the process described earlier in this section 
will exhibit the equivalence of equivalent sets. As before, let n; be the number 
of times a, and a, appear in (W) and let g be the number of n’s which are not zero. 
Then L(W), q and n; are each unaltered by level T-transformations and it follows 
that these numbers, calculated for a minimal (7'’) set (W), are invariants of the 
family of equivalent sets. They are analogous to L(a), m and k, = Ua), 
calculated for a minimal (N) set (a). 


4. In this section we indicate how the results of §§2 and 3 can be applied 
to the study of free factor groups of free groups. Let I be the free group 
generated by a;, --- , @m and let y be the factor group I'/I'y, where Ip is the 
smallest invariant sub-group containing 


Ri(a, a’), et ag R,(a, a’). 


Let a, — a, (a, a’) define a simple isomorphism between y and the free group 
G, generated by a1, ---,@,. Then 


(4.1) R,[a(a, a’), a’(a, a’)] = 1, 
an element F(a, a’), of I, belongs to Tp if 

Fla(a, a’), a’(a, a’)] = 1, 
and there are elements F; (a, a’) such that 


a; = F,[a(a, a’), a’(a, a’). 
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Se, Therefore a (a, a’), --- , @m (a, a’) constitute a basis for G, and there is a basis 
in for T, say Bi (a, a’), --- , Bm (a, a’), such that the set consisting of the corre- 
he sponding elements in G, namely 


By (a, a’) = By [a(a, a’), a’(a, a’)], 


is reduced (N). Re-arranging the notation, if necessary, it follows from theo- 


rem 1 that 
ve 5 : 
eSs (4.2) B; (a, a’) = a; @=1,---,p), 
es . 8, (a,a’) = 1 (op =pt+1,---,m), 
e 
ble m exceeding” p unless the relations R, = 1 are vacuous. Then y is the sub- 
he group freely generated by #1, --- , 8,, and the relations 
(4.3) S; (6, 6’) = --- = S, (6, 6’) = 1 
are equivalent to 
of (4.4) Bou = pe = B,, = a 
, where 
on 
ber &, (B, B’) _ R, [a(8, 8’), a’(B, 6’)). 
a In particular, let y be defined by a single relation 
the R(a, a’) = 1, 
x»); where R ¥ 1 when the word R is reduced (E, a). With the notation used in the 
last paragraph we see by considering the free Abelian group associated with I 
‘ed that p = m — 1, since p < m and the relations (4.4) are equivalent to (4.3), 
“- with g = 1. Moreover it follows from the equivalence of (4.3) and (4.4) that” 
the S(6, B) = Bs, 
when the cyclic word S is reduced (E, 8). That is to say the cyclic word R(a, a’) 
is simple. 
Conversely, if R is a simple word it is obvious that y is a free group, and we 
sup have the theorem: 
THeorem 4. The group generated by a1, --- , am, subject to a single relation 
R(a, a’) = 1, 
which is not vacuous, is a free group if, and only if, the cyclic word R(a, a’) is 
simple. 





*Cf. F. Levi, Math. Zeit., 37 (1933), 90-97. 
* W. Magnus, Journal fiir die r.u.a. Math. 163 (1930), 142. 
See also K. Reidemeister, Einfiihrung in die kombinatorische Topologie, Brunswick 
(1932), p. 92. 
*°W.G., §1. 
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It follows from this theorem that one can tell whether or no such a group is a 
free group by the process of reduction (7). 


5. A problem which remains open is to determine whether or no I is equiva- 
lent to '* under the automorphisms of G, where [ and I* are sub-groups of the 
latter, generated by given sets of elements (a) and (a*). If I and I*, generated 
by finite sets (a) and (a*), are equivalent, one might hope to identify them by 
sequences of reductions (N) and (7), applied to (a) and (a*), followed by level 
transformations. The following example shows that one cannot always do this 

Let 


a, = (ab)? b-*(ab)? a’, A, = at, 
and 
a = ab ab, Be = (ab)-* b-. 


Then (a;, 6:1) and (a2, Be) are both unassailable by reductions of either kind. 
Since L(a,, 61) = 21 and L(ae, Be) = 22 they are not interchangeable by level 
transformations. All the same, if 


a = a’b-a?(ab)3, B = (ab) b>, 
(a;, 81) — (a, 8) in the automorphism (a — ab, b > b), while (a2, B2) = (a8, 8) 
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ON THE COMPLETENESS OF LAMBERT FUNCTIONS, II! 


By Einar HIE anv Orto Szisz 


(Received April 27, 1936) 


1, Introduction. In an earlier note by the same title? the authors have in- 
vestigated completeness properties of systems of functions of the form S(a, 8, \): 


(1.1) (1 — t)*kg(t*), 

wheren = 1,2,3,---,0<t¢< l,and 

(1.2) ke(w) = >> mow. 
1 


Their results could be summed up in the following 

THEoREM 1. Let R(An) 25> 0, DoT R(1/An) = +. The system S(a, B, d) 
is complete in L,(0, 1), p fixed, 1 S p < ~,tfa > max (6 + 1 — 1/p, — 1/p), 
and 1 + S(a, B, \) ts complete in C(O, 1] if a > max (6 + 1, 0). 

These results admit of various generalizations which will be given in the 
present note. A set of elements is said to be complete in a (Banach) space if the 
only linear functional defined all over the space which vanishes for all elements 
of the set is the zero functional. Using this definition we associated with the 
given set S a certain analytic function F(z), holomorphic in the right half plane. 
If the set is incomplete in the space in question, it is possible to choose F(z) so 
that it vanishes at z = X, for all n; the generalized theorem of Blaschke being 
applicable, F(z) vanishes identically. On the other hand, F(z) turned out to be 
expressible in the form 


F(z) = >> m® L(mz), 


where L(z) is a Laplace transform, and conversely L(z) was expressible linearly 
in terms of F(nz) with the aid of the special inversion formula of Mdébius, the 
inversion being unique within the class of functions appropriate to the problem. 
Hence F(z) = 0 implies L(z) = 0, and from this we could conclude the com- 
pleteness by easy steps. 

The same general method applies without essential change to much more 
general systems. Thus in §2 below we replace (1 — ¢)* by an arbitrary weight 
function g(t). In §3 we replace in addition the coefficients m® in the generating 
function kg(w) by arbitrary complex numbers a», such that a; = 1, | @m | S Bm*. 
This change is of more fundamental nature than one would think a priori, be- 
/Stltnsintapiieemiiies 

' Presented to the American Mathematical Society April 10, 1936. 

* Bulletin Amer. Math. Soc., 42 (1936), 411-418. 
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cause we have now to resort to the general form of the inversion formula of 
Mobius, and the restrictions on the parameters have to be made more severe 
than in the special case. In §4 we take one further step by replacing the gen- 
erating power series by an irregular power series whose exponents form a multi- 
plicative system. On the other hand, the mere replacement of the exponents 
{m} and the coefficients {m*} by other multiplicative systems does not call for 
any other tools than the special inversion formula. In §5 we discuss the com- 
pleteness in C[0, 1] of the system 1 + ba + 1, 8, \) which requires a refinement 
of the method. 


2. General weight functions. Let y(t) be a measurable function such that 
to(t) e L,(0, 1) for every » > 0, and g(t) = 0 at most on a set of measure zero. 
We set 


1 
(2.1) Miz) = fe] e(0| at, 
and 
we init Se 


Here 0 S y S ~. We assume y < © in the following. Then to any e« > 0 
there exists a § = &(e) such that 


(2.31) M(x) < 27+ for x > é, 
(2.32) M(x) > x7‘ for infinitely many x = z,— + ~. 


We can characterize y by other properties as well, which is shown by the following 
lemmas. 
Lemma 2.1. The exponent of convergence of the series 


> [M(n2»)]*, Xo > 0, 


equals 1/y, independently of 2. 
Proor. The convergence of the series for ¢ > 1/y follows from (2.31). 
Conversely, if the series is convergent, 


nile M (na) -> 0, 


since the terms are steadily decreasing. This implies that M(x) = o(2~”) 
as z—> «©,since M(x) ismonotone. Hence o = 1/y by (2.32). 
Lemma 2.2. Putting 


v0) = J | ole) | du, 


we have 
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Proor. We have 


w< f° (F) lee | au = ofr(* Yh = of — 04 


Conversely, integrating by parts, 


[ ul plu) | de 


I 


—[uzy(u)}} + | uz W(u) du 


y(t) + a uz—V(u) du 


— M(x) ast 0. 
Hence, for x > 0, #¥(t) ~ 0 ast—0, and 


M(z) = rf uzyY(u) du = ofs [ uz-"(1 — wna} = O(2-7+*), 


These estimates prove the lemma. 
Let us write S(y(t), 8, \) for the system of functions 


g(t)ks(t**), n = 1,2,3,--- 


where kg(w) is defined by (1.2). We shall prove a generalization of that part 
of Theorem 1 which refers to completeness in 1,(0, 1). Our assumptions on 
y(t) are adapted to this case, but the reader who is interested in completeness in 
L,(0, 1) or in C[0, 1] will have no difficulties in modifying these assumptions so as 
to suit his needs. 

THEOREM 2. Let y(t) satisfy the assumptions stated above. Let R(A,) 2 6 > 0, 
YT RA/An) = +o. Then S(y(t), B, ) is complete in Li(0, 1) provided 
y>B+1. 

Proor. We follow the procedure employed in the proof of Theorem 1. It is 
necessary as well as sufficient to show that the only bounded measurable function 
which is orthogonal to all functions of the system S, is equivalent to zero. We 
form 


1 
4) F(z) = i ka(t*)o(t)g(t) at, R(z) > 0, 
0 
with an arbitrary bounded measurable function g(é). If | g(t) | S g, then 
(2) | F(z) | <9 } m M(mx) Ss gM .a~7** ps mi-1+¢ 
’ i 


fre >0,2> 5(€), and a suitably chosen constant M,. The series converges 
for a sufficiently small ¢ since y > 6 + 1. It follows that the functions of S 
belong to L,(0, 1). Further, F(z) is holomorphic in the right half plane and 
bounded for x >5> 0. If the system is not complete, there exists a g(t) such 
that the corresponding function F(z) vanishes for z = \,,n = 1, 2,3,---. By 
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cause we have now to resort to the general form of the inversion formula of 
Mobius, and the restrictions on the parameters have to be made more severe 
than in the special case. In §4 we take one further step by replacing the gen- 
erating power series by an irregular power series whose exponents form a multi- 
plicative system. On the other hand, the mere replacement of the exponents 
{m}\ and the coefficients {m*} by other multiplicative systems does not call for 
any other tools than the special inversion formula. In §5 we discuss the com- 
pleteness in C[0, 1] of the system 1 + S(@ + 1, B, \) which requires a refinement 
of the method. 


2. General weight functions. Let y(t) be a measurable function such that 
to(t) « L,(0, 1) for every » > 0, and g(t) = 0 at most on a set of measure zero. 
We set 


: : 
(2.1) Miz) = [ e\o(0 | ae 
0 
and 
—1 
(2.2) lim inf log [M(x)I" = 
2900 log x 


Here 0 S y S ©. We assume y < © in the following. Then to any « > 0 
there exists a £ = &(e) such that 


(2.31) M(x) < x-7** for x > é, 
(2.32) M(x) > «-7~* for infinitely many + = z,— + @. 


We can characterize y by other properties as well, which is shown by the following 
lemmas. 
Lemma 2.1. The exponent of convergence of the series 


> [M(na»)]°, Xo > 0, 


equals 1/y, independently of x. 
Proor. The convergence of the series for ¢ > 1/y follows from (2.31). 
Conversely, if the series is convergent, 


nl* M (na) -> 0, 


since the terms are steadily decreasing. This implies that M(x) = o(2~"”) 
as x— «,since M(x) ismonotone. Hence o = 1/y by (2.32). 
Lemma 2.2. Putting 


Vi) = i "etd Tee, 


we have 


— lim int 8 YOR 
7 = nan iat log 1 — ft’ 
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Proor. We have 


wd < f° (GZ) Lee | au = ofar(* Yh = ofa — 94 


Conversely, integrating by parts, 


[ u*| o(u) | du 


1 
—[uzy(u)]} + | uzlW(u) du 


1 
y(t) + | uty (u) du 


— M(x) ast— 0. 
Hence, for x > 0, # Y(t) — 0 ast— 0, and 


M(x) = ah uzy(u) du = of [ uz-"(1 — wredu} = O(x-7**), 


These estimates prove the lemma. 
Let us write S(y(t), 8, A) for the system of functions 


g(t)ks(t*), n = 1,2,3,--- 


where kg(w) is defined by (1.2). We shall prove a generalization of that part 
of Theorem 1 which refers to completeness in L;(0, 1). Our assumptions on 
y(t) are adapted to this case, but the reader who is interested in completeness in 
L,(0, 1) or in C[O, 1] will have no difficulties in modifying these assumptions so as 
to suit his needs. 

THEOREM 2. Let o(t) satisfy the assumptions stated above. Let R(An) 26 > 0, 
D7 R(1/An) = +o. Then S(y(t), B, d) is complete in L,(0, 1) provided 
y>B+1. 

Proor. We follow the procedure employed in the proof of Theorem 1. It is 
necessary as well as sufficient to show that the only bounded measurable function 
which is orthogonal to all functions of the system S, is equivalent to zero. We 
form 


1 
a4) Ce) = [hele edad at, RE) > 0, 
0 
with an arbitrary bounded measurable function g(t). If | g(t) | S g, then 
(2.5) | F(z) | < g >> m®M(mz) < gM,a-** }) m***, 
1 1 


lore > 0,2 > 5(€), and a suitably chosen constant M,. The series converges 
for a sufficiently small ¢ since y > 8 + 1. It follows that the functions of S 
belong to L,(0, 1). Further, F(z) is holomorphic in the right half plane and 
bounded forz >6 > 0. If the system is not complete, there exists a g(d) such 
that the corresponding function F(z) vanishes for z = An, n = 1, 2,3, --- - By 
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the extension of Blaschke’s theorem to the half plane, F(z) is identically zero, 
On the other hand 


(2.6) F@) = E meL(me), 

where 

(2.7) Lie) = | reldald at 

(2.8) | L(z)| S gM. a-7+*, a= 6(e) > 0. 


But if L(z) satisfies (2.8) then it can be expressed linearly in terms of the set of 
functions F(nz) with the aid of the inversion formula of Mébius. Let y(n) be 
the factors of Mébius. Then the series 


(2.9) p u(n)n8 F(nz) = : >» u(n)(mn)* L(mnz) 
1 1 1 
is absolutely convergent, being dominated by the series 
gM. qa rte > >» (mn)8-7+¢ ; 
1 1 


which is convergent by assumption. We can then rearrange the double series in 
(2.9) and reduce with the aid of the properties of n(n). The result is that 


(2.10) Lig = Ft 


is the only function satisfying (2.6) and (2.8). In our case F(z) = 0, so that 
L(z) = 0, and by (2.7) this implies ¢(t)g(t) ~ 0 or g(t) ~ 0 since ¢(¢) vanishes 
only on a null set. Hence we have proved the completeness of S in Ly. 

The condition y > 8 + 1 cannot be improved upon, because if y = 6 + 1, 
the set S may cease to belong to L,, so that the completeness problem becomes 
meaningless. This is shown by taking g(t) = (1 — t)*,a > —1,ie., the special 
case of Theorem 1. Here y = a + 1, and if a = 8B, none of the functions 
(1 — t)*k,(t**) is in L,(0, 1). 


3. Other generating power series. The work of our first note as well as 
that of the preceding paragraph is based upon the special inversion formula of 
Mobius. This is due to the structure of the generating function k,(w), namely 
to the fact that the coefficient of w” is precisely m*. If the coefficient is merely 
O(m®) instead, the situation changes, and the general form of the inversion 
formula must be used. It will be sufficient to consider the case p = 1. 

THEOREM 3. Let g(t) and {d,} satisfy the assumptions of §2. Let a = |, 


| dm | S Bm*®,m = 2,3,---. Let S(y(t), a, d) denote the system of functions 
(3.1) g(t) >> ant”, n = 1,2,3, °°: 
m=1 
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S(o(t), a, X) ts complete in Li(0, 1) fy > B + p(B), where p(B) is the positive root 
of the equation 

1 
(3.2) is) =1+ 5. 
For y = B + p(B), there exists a choice of v(t) and {a,,} satisfying the above con- 
ditions, such that the corresponding system S(y(t), a, ) is in L,(0, 1) without being 
complete in the space. 

Remark. We note that the equation (3.2) has one and only one root >1 for 
every fixed positive B, and that p(B) is a monotone increasing function of B 
which tends to 1 or to © according as B tends to 0 or ~. 

Proor. We proceed as above. The function F(z) is defined by 


i] 


(3.3) F(z) = > an / ; t™o(t)g(t) dt, 2x«r>0. 


1 
Formula (2.5) is still valid. Since p(B) > 1 the dominant series converges, so 
that S(o(#), a, A) C Li(0, 1), and F(z) is holomorphic in x > 0 as well as bounded 
forz 26 > 0. We conclude as above that if S is not complete in L;, we can 
find a g(t), not equivalent to zero, such that F(A,.) = 0 for every n, and conse- 
quently F(z) =0. On the other hand 


(34) Pia) F eckhend, 


where L(z) is defined by (2.7) and satisfies (2.8). 
At this juncture a new element enters into the proof. Let b,,n = 1, 2, 3, --- 
be determined by the algorithm of Mébius? 


a,b; = 1, 


D> aabnya = 0. n = 2,3,--- 


d\n 


(3.5) 


The b’s are uniquely determined and are given by the following expression 
(3.6) b, = p (—1)eteC. ...agtag? eho 


Here the d’s run through all the divisors of n which are >1 and n = d{f'dq --- ; 
C..a-.. is the combinatorial function which gives the number of arrangements 
of an aggregate consisting of a; objects of one kind, az objects of another, --- , 
order being essential between elements belonging to different classes but not 
within the class. 

Let us now form the series 


io) 2) io) 
(3.7) > bn F(nz) = DS bn DS an L (nz). 
1 n=1 m= 1 
ee 
*See A. F. Mébius, Journal fiir die reine und angewandte Mathematik, 9 (1832) 105-123, 
and Gesammelte Werke, IV, pp. 591-612. Parts of this paper are reproduced in Netto’s 


Lehrbuch der Combinatorik, 2nd ed. pp. 172-180. Formula (3.6) below is easily proved by 
complete induction. 


ee ee ee 
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The double series is absolutely convergent if 
(3.8) Le > (mn)—7*© | dm | | Bn | 
1 


converges. But (3.6) shows that 

[ba| S D> Cores--- b@a |* | ae, |%* ++ 
F Cocey OO RO... 
=n? Cea... BU, 


where the summation extends over the various factorizations of n. Hence (3.8) 
is dominated by 


8.10) FF on)? Casa BE 
n=1 m=1 


Since a > 8 + 1, the summation with respect to m can be carried out and gives 
the factor ¢(y — B — e). In the remaining summation with respect to n we first 
isolate the term corresponding to n = 1, this term can be taken to be unity 
since b; = 1 and the estimate yielded by (3.9) is vacuous. Collecting terms in 
which a; + a, + --- has a constant value, we can rewrite (3.10) as a power 
series in B, obtaining 


ty -e-o{1+ > ae Meimdet 


n=2 La;taet:--= 


(3.9) 


IA 


Here the square bracket gives the number f;(n) of factorizations of n into k 
factors each greater than 1, order being essential. It is well known‘ that 


D> faln) n-* = [5(6) — UK, ) > 1. 


Hence (3.10) is a geometric series in B whose sum reduces to 


fy —B-—»® (1 — Blt — B- ® — I)}-, 


the series being convergent for y > B + € + p(B). It follows that the double 
series in (3.7) is absolutely convergent and may be reduced with the aid of the 
algorithm (3.5). Hence 


(3.11) La) ~ Fb Pan 


is the only solution of (3.4) which satisfies condition (2.8). From this point on- 
wards the completeness proof proceeds as above. 





‘See L. Kalmar, Matematikai és Fizikai Lapok, 38 (1931) 1-15, and Acta Litterarum ac 
Scientiarum, 5 (1931) 95-107 for this “‘factorisatio numerorum”’ problem which differs from 
the divisor problem of Piltz. 
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It remains only to show that the limit found for y is the best possible, other 
things being equal. We choose y = 8 + p(B), 
(3.12) an = —Bmé, m = 2,3,.-.--. 


This choice makes all the terms positive in (3.6), i.e., it gives the b’s the largest 
values which they can have when |a,,| < Bm*. The homogeneous equation 
corresponding to (3.4) now reads 


(3.13) H(z) —B » 6 m® H(mz) = 0, 
2 

and admits of the solution 

(3.14) H(z) = 2-8 0) = z-7 


which satisfies condition (2.8). It follows that F(z) vanishes identically if 
L(z) = H(z). Since 


H(z) = x 5) * yet (tog +" dt, 


we can choose 


(3.15) eo) = a ; (toe ‘ys g(t) = 


This choice satisfies the conditions of §2. The corresponding system 
S(¢(t), a, A) C L,(0, 1), but it is not complete in this space because g(t) = 1 is 
orthogonal to every function of the system. The deficiency of the system S, i.e., 
the number of functions g(é) orthogonal to the set, g(#) « L,,(0, 1), equals the 
number of linearly independent solutions of (3.13) which satisfy (2.8). This 
number is at least one, but may be larger. 

It is of some interest to consider for a moment the two systems 


(1)—1 1 {” 
(3.16 a 
) log ; ii— 
(n = 1, 2, 3, -++) 
3.17 we te 
) log ; i 1—-t’ 


of which the former is complete in L,(0, 1) and the latter isnot. The reason for 
this striking difference in behavior between two so closely related sets of func- 
tions does not seem a priori obvious. 


4. Generating Dirichlet series. In the previous discussion we have made 
essential use of the fact that the set of functions {F(nz)} is carried into a subset of 
itself by the transformation z | mz, in other words, of the property of the natural 
tumbers of being a multiplicative system. Now the integers are brought into 
our problem by the power series defining the generating function, and it lies at 
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hand to try if the method does not extend to irregular power series whose ex- 
ponents form a multiplicative system. This extension will be given in the pres- 
ent paragraph. For the sake of convenience we work with Dirichlet series 
rather than with irregular power series which means replacing the interval 
(0, 1) by (0, ©). We consider only completeness in L,(0, ©), the extension to 
L,(0, ©) and C[0, ©] being obvious. 

Let the positive quantities e(n) form a multiplicative system, i.e., 


(4.1) e(mn) = e(m)e(n), m,n = 1,2,3,--- 


It follows that e(1) = 1. The system is clearly determined by the sub-system 
{e(p)}, where p runs through the primes. We suppose that the abscissa of con- 
vergence of the series 


(4.2) > le(p)" 
equals o0,0 S o < «©. Form the corresponding zeta function 
(4.3) ds) = TE 1 - Ler} = 3 ter, 


which is also convergent for R(s) > oo, and the adjoint exponential series 
(4.4) Es) = Dy ee), 
1 


For every positive 7, e~* = O(a) asx-> ~. Hence foro > 0 


E.(0) = oe > eenst*\ 


n=1 


The series on the right hand side converges for 7 > oo. Hence (4.4) converges 
for R(s) > 0, and 


(4.5) Ec) = O(o **") 
for every positive 7. It is frequently permitted to suppress 7, but this is of no 
importance for the following. It is clear that 


(4.6) E8(s) = > [e(n)]8 e-se™ 


is also convergent for R(s) > 0, and that 
(4.7) EY? (c) = O(o- 7 *)-1) 4 O(1). 
¢.(c) is a monotone decreasing function of o in the interval a) < « < © which 


tendstolasc— oo. Put 


lim ¢.(c) = Z. 
o—ao+0 
Here Z is either + ~ or a finite quantity >1. The latter possibility cannot be 
excluded since the set {¢(p)} can be chosen arbitrarily, subject only to a con- 
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vergence condition. Let J. denote the interval 1 < o S$ Zorl <a < ~ 
according as Z is finite or infinite. We shall prove the following 

TurorEM 4.1. Let (u) be a measurable function which is ¥ 0 except ina set of 
measure zero, such that &(u) e~™ € L,(0, ©) for every n > 0, and 


(4.8) i e™ | B(u) |du S Mex, x = 4(6) > 0, 
0 


fora fixed y > 0 and every 6>0. Leta; = land|an| < Ble(m)]® for a fixed 
real B. Let Rn) 25> 0, OOP R(1/An) = +0. Then the system S(®(u), a, €,d): 


(4.9) &(u) p Am e~>n™ , n=1,2,3,---, 
m=1 
is complete in L,(0, ~) ify >B+p. Here p = p(B) = oo, where p.(B) is the 


positive root of the equation 


(4.10) ¢(s) = 1+ Mt 


provided the right hand side of this equation belongs to J., otherwise p = oo. In the 
former case there exists a choice of (u) and of {am}, satisfying the above conditions, 
such that the corresponding system S(®(u), a, €, \) ts in Li(0, ©) without being 
complete in the space if y = B + p.(B). 

Proor. We proceed as in the previous cases. We have to form 


(4.11) F(z) = Qn . e~=ue(m) (wu) G(u) du, R(z) > O. 

The assumptions enable us to show that 

(4.12) F(z) = ofr > lmyp-vi 

The series being convergent, it follows that F(z) is holomorphic in x > 0, and, in 


particular, that S C L,(0, ©). If the system is not complete, there exists a 
function G(u) ¢ L,,(0, ©), not equivalent to zero, such that F(z) =0. Further 


(4.13) F(z) = > Am L(e(m)z), 

where ta 

(4.14) L(z) = i ert &(u) G(u) du 

(4.15) | L(z) | S @ Mea-7**, x = 6(0) > 0. 


We determine the quantities b,, from the Mobius algorithm (3.5) and form 


: b, F(e(n)z) = > bn > Gm L(e(mn)z). 
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The absolute convergence of the double series is established as in §3. We obtain 
a majorant of the form 


Cs ar? § ly ying B cial 6) {1 ini Big (y a B 2) 6) —1)}}-. 


This is an analytic function of y which has a right half plane of holomorphism eq 
Ry) > vo + 0 = oo + 4, and if yo > oo, then yo = p.(B). Hence the double 
series is absolutely convergent for y > Yo, and for such values of y, 





(4.16) L(z) = >> bn F(e(n)z) is 
n=1 of 
is the only solution of (4.13) which satisfies (4.15). In particular, if F(z) = 0 
i so is L(z), whence it follows that G(u) ~ 0 and that S is complete. (4. 
i The construction of a ‘Gegenbeispiel’ follows the same lines as in §3. We take 
(4.17) Am = —Ble(m)]?, m = 2,3,---. sel 
Then the homogeneous equation | 
A} (4.18) H(z) — B >) [e(m)]* H(e(m)z) = 0 (4 
HH m=2 
| is satisfied by Tt 
i (4.19) H(z) = 2-6) = 277, | 
ne 4, 
eee provided 1 + 1/B belongs to J, and we assume y = 6 + p.(B). Taking 
ti 1 Th 
(4.20) @(u) = ur, G(u) = 1, ple 
l'(y) Bi 
we obtain a system S(®(u), a, e, 4) C Li(0, ©), but not complete in this space 
since G(u) = 1 is orthogonal to every function of the set. If 1 + 1/B does not 
belong to J., this construction breaks down. Moreover, a glance at the proof 
shows that the completeness of the system S is preserved for y = 8 + 09. On anc 
the other hand, we cannot permit any smaller value for y, because, as is easily - 
shown by examples, S will then cease to belong to L:(0, ©), so that the com- Y 
pleteness problem becomes meaningless. ae 
Theorem 4.1 is a generalization of Theorem 3. We can find a corresponding 
generalization of Theorem 2 in which we merely replace the exponents {m} by 
another multiplicative system {¢(m)} and leave the coefficients {m*} alone. , 
The analysis will then involve the special inversion formula again. This, how- ve 
ever, is due essentially to the fact that {m*} is a multiplicative system. We are , 


thus led to a further generalization of Theorem 2 in which both the exponents , 
{m} and the coefficients {m*} are replaced by multiplicative systems, {¢(%)} y 





i and {a(m)} respectively. - 
i We make the same assumptions on {¢(m)} as above. Let a(m) be real or com- 
plex, a(1) = 1, » 


(4.21) a(mn) = a(m)a(n), m,n = 1,2,3,-°") was 
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and let the abscissa of absolute convergence of the series 


(4.22) 2. a(p) [e(p)}-* 


equal 7,0 S ™ < ©. ‘Then the series 


> a(n) emse(n) 


is absolutely convergent for #(s) > 0. Let S((u), a, «, 4) denote the system 


of functions 


(4.28) ®(u) D> alm) ene, n=1,2,3,---. 


m=1 


THEOREM 4.2. Let &(u) and {d,} satisfy the assumptions of Theorem 4.1. The 


set S(b(u), a, €, A) ts complete in L,(0, ©) provided y > 7. 
Proor. As usual we form 


(4.24) F(z) = > a(m) I ‘ e~(™eu &(u) G(u) du, R(z) > 0. 
Then 
(4.25) F(z) = Ofer y | a(m) | elmy-r¥, 


The series is convergent for sufficiently small @ > 0. If the system S is not com- 
plete, there exists a G(u) «L,(0, ©), not equivalent to zero, such that F(z) =0. 
But, with the usual notation, 


F(z) = : a(m) L(e(m)z), 


and 


¥ uln) a(n) Fle(n)2) = ” i deine Lidene 


n=] 


= Ps a(k) L(e(k)z) p> u(d) ™ L(z), 


the double series being absolutely convergent. Hence L(z) = 0, G(t) ~ 0, and 
Sis complete. 

It is clear that in Theorem 4.1 we can replace the assumption | a» | S Ble(m)]* 
by|dn| S Ba(m), where the {a(m)} form a multiplicative system, if the other 
assumptions are suitably modified. 


5. Completeness in C[0, 1]. In this paragraph we shall investigate the 
question of the completeness of the system 1 + S(8 + 1, 8, A) in C[0, 1] which 
Was raised in our first note. ° 








ee eee 
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TureoreM 5. The system 1 + S(6 + 1, B, dA) 7s complete in 
ClO, 1] af -1 <B <2, 


and 
Rn) Z6>0, DO R(I/Ad) = +2. 
1 


For the proof we need some lemmas. 

Lemma 5.1. Q,(t) = t'(1 — #4 kg(t) 18 an increasing function of t in 
0<t<lforlsf8. 

Proor. A straight forward calculation shows that the derivative of Q,(t) 
equals 


(1 OD (n+ [w+ 2% — (w+ IP (W414 He > 0 


since every square bracket is positive.® 
Lemma 5.2. For0 <é#<1,-1<652,n =1,2,3,--- 


mbt yn (n + 1)8+1 jr 


a(t, B, n) = (i ae {n)eH gt (1 om intl) b+1 








Proor. We begin with the case 8 = 2. A simple calculation shows that 
5(t, 2, n)(t — 1)? is holomorphic at t = 1 and tends to a positive limit as f — 1. 
Writing P,,(t) for the product of 5(t, 2, n) with -" (1°— #”)3(1 — t***)3 we get 


P,(t) = A — Bt + 3(B — Aji! — 3B! 4 3 APY + Bet — Ars, 
where A = n3, B = (n + 1). 

By the rule of Descartes, the equation P,(¢) = 0 has at most five positive roots. 

But P,(t) = (1 — t)*Q,(t), where Q,(1) > 0, further P,(0) > 0 and P,(é) - —* 


as t— +0. Hence there are actually five roots, but the fifth root must be 
greater than 1, so that the lemma is true for 8 = 2. Hence for —1 < B < 2 


1 1 
n (n+ 1)t® — (n+ 1)t8+1 
7s * 1 — pH ” j—-pen ”’ 





which proves the lemma.*® 
Lemma 5.3. For0 <t<1,1S5 68 S$ 2,n = 1, 2,3, --- 


A(t, B, n) = nPHkg(tr) — (n + 1)PHk g(t) > 0. 





5° See Hardy, Littlewood, and Pélya, Inequalities, p. 39, formula (2.15.1). 
° For the special cases 8 = 0 and 1 see Hardy, Littlewood, and Pélya, op. cit., p. 41, 
formula (2.15.6), and p. 99, Theorem 131, resp. . 
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Proor. Since f+! < i*, Lemma 5.1 implies 


n+! tn 


A(t, B, n) = a —ryPr t-"(1 — t)8+1 kg(t") 


(n + 1)## {ntl 
or: (1 — nye 





t(1 = {rt+1)B+ kg(t"*") 


> (1 — emH)b+ g(t) (t, B, n) > 0 


by Lemma 5.2. It is of some interest to note that A(t, 8, n) is holomorphic at 
{= 1 when B is not a negative integer, and 


a(t) A(1, B, n) = ¢(—6)[n*+! — (nm + 1)4+]. 


This is readily obtained from the known representation of kg(w) near w = 1.7 

It shows that Lemma 5.3 is not true for 4k + 2 < 6 < 4k 4+ 4,k =0, 1, 2,---, 

and we do not know if it is true for any value of 8 outside of the range 1 S 6 < 2. 
Lemma 5.4. For —1 < B S 2 the series 


> u(n)n® ke(t")(1 — t)6+! = t(1 — eH 


n=1 . 


converges boundedly® in 0 < t < 1. 
Proor. The series is obviously absolutely convergent for |t| < 1, and the 


hat sum of the series is found by applying the inversion formula of Mébius to the 
>1. power series of k(t). In order to prove the bounded convergence we apply 
summation by parts, and note that 
(5.1) | a(n) | < 1 where a(n) = >> = , 
m=1 
3 
1). the proof being elementary. Hence 
ots. 


_ op Sng(t) = i u(n)n8 ke(t”) = p ji(n)[n®+! kg(t”) — (n + 1)8+ kg(t"*)] 
+ fi(m)m5* k(t"). 


Ifl < 6 < 2, Lemma 5.3 applies and shows that all square brackets are positive. 
Hence by (5.1) Sm,a(é) is dominated by the expression obtained by replacing 
i(n) throughout by unity, i.e., by kg(t). Since (1 — t)8+'k,(t) is bounded in 
0<t< 1, the lemma is proved for the case 1 < 8 S$ 2. But we can reduce the 
remaining cases to this one. Let us first assume 0 S 6B < 1. Since 


(5.2) k(t) = i ; u- keya(tu) du, 


ee ee ae 


"Cf. E. Lindelof, Calcul des residues, p. 139. 
41, *More than this is true. We could actually prove uniform convergence, but the proof 
seems to require the prime number theorem or results of similar depth. 
*See E. Landau, Vorlesungen tiber Zahlentheorie, I, p. 21. 


. 
; 
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we have ar 


m 1 
Sm,s(t) = >> p(n)n® [ ukess(t*u) du at 


n=1 


m 1 
> uln)nPt i “epsaltor®) dr 
0 


n=1 


1 
= i ie Sm,p41(tr) dr. ant 
0 


But | Sm,eyi(t7) | S keyi(tr) since 1 S B + 1 S 2. Then (5.2) shows that gat 
Sm,a(t) is still dominated by k(t), so that the lemma is also true for 0 < 6 < 1, in] 
and the argument can evidently be repeated for the interval —1 < B < 0. 

Proor oF THEOREM 5. If the system 1 + S(6 + 1, 8, A) should not be com- 
plete in C[0, 1], there exists a function g(t) of bounded variation in 0 < ¢ < 1 
such that g(0) = g(1) and ' 


/ * kg(*)(L — 84 ag(t) = 0, n= 1,2,3, +... 
As usual we conclude that 
(5.3) F(e) = | baled — 0 dg(d) = 0. 
Form 
(5.4) L(z) = iy t(1 — t)+ dg(t), R(z) = 2>0. 
By Lemma 5.4 
(5.5) ? = x u(n)n? kp(t"), 


where the series converges boundedly after multiplication by (1 — #)**. 
Since [(1 — #)/(1 — #)]#* is a bounded function of ¢ in 0 < ¢ < 1 for a fixed 
x > 0, the bounded convergence implies that 


L(x) = lim { > a(n g(t} (1 — #)4+! dg(t) 


m—o J0 


= >> u(n)n®F(nz) = 0. 
n=1 
L(z) being analytic, we have L(z) = 0, whence we conclude that g(t) is constant 
as in our first note. 
In particular, the systems 


n (1 =e t)2¢" 
jor “hae 








B+, 


xed 


ant 
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are complete in C[0, 1]. This settles the problem raised at the end of our first 
note. It is quite likely that Theorem 5 holds over a greater range of values of 
than —1 < B S 2, but our present method seems to be limited to this interval. 


Our two notes on completeness of Lambert functions raise a number of prob- 
lems. There are applications to the Tauberian theory of various classes of 
Lambert series, and possibly also to the prime number theorem. The character 
and range of validity of the inversion formula of Mobius, its number-theoretical 
features, and its possible extension to other inversion problems invite investi- 
gation. The authors plan to return to these problems individually or jointly, 
inlater papers. . 


YALE UNIVERSITY, 
MassaAcHUSETTS INSTITUTE OF TECHNOLOGY. 
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I 


Dr. S. M. Ulam communicated the following problem to the present author. 
Suppose there are given N urns each containing the same total number of 
H white and black balls; let p,(0 < p, S$ 1, vy = 1, --- , N) denote the unknown 
i ratio of white balls to all balls in the »** urn. We assume that in each urn all 
eae values 0 < p, S 1 are equi-probable a priori, an that there is no dependence be- 


tween the distributions of white and black balls in the different urns. 

Let us assume that on drawing one ball from every urn, white appears m 
times and black n times, m + n = N. On the strength of this assumption it is 
easy to give an expression for the function P»m,,(p) representing the probability 


for the relation 


hanes (1) og tt SP 45, 





N 


In fact, since p,dp, is the infinitesimal probability for drawing a white ball from 
the vth urn, and (1 — p,)dp, for drawing a black one we obtain, by the theorem 


of Bayes, the relation 
(2) Prnn(p) = 


where 


(4) 0sp 1, 


m = Nn, or, more generally, 


; 1 
5 ao 
” ho ae 





the function Pm,.(p) will satisfy the limit relation 


0, if p< 


(6) lim Pm,n»(p) = 


N00 l,ifp> 


816 


it 


625- pect aee 


J mn(p) 
Faker 


0<p<il. 


(3) Jan(p) = J Di «++ Dm(1l — pmai) +++ (1 — pw)dp, --+ dpw, 
Ay,N 


the point set A,,7 being the intersection of the unit cube 


with the layer (1). The question arises as to whether the function Pn,»(?) 


has any distinctive properties as N tends to ©. Dr. Ulam conjectured that for 


wie tle 


‘Te 
hot coy 
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namely 
a (12) 0<h <b <1, 
(13) b3 — bibs < 0, 
(14) (b; — be)? — (1 — bi) (be — bs) < 0." 
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The interpretation would be as follows. If, for a large number of urns, on draw- 
ing one ball from each urn, white appears nearly as often as black then it is prac- 
tically certain that the average number of white balls in all urns taken together 
is nearly equal to the corresponding average of black balls. We shall see that 
this conjecture is correct. 

The problem can be generalized. We replace condition (5) by the more 
general condition 


epg ae , 


. m 
(7) wed N = t, 


(t being the “tested” relative average of white balls). Then one might expect 
the relation 


0stsl 


0, 0 Ss Pp < Po : 

(8) lim P»n»(p) = ) 

sills) 1, Po < p s 1 + 

with i 

if 

() Do = t. ! 

es 

We shall see that (8) is true but not (9). Relation (9) is to be replaced by { 
0) po = Ht + 1), | 1 


r = 


— os 
See 


in other words there does exist an asymptotic “probable” relative average po 
but its relation to the tested average ¢ is not so trivial as one might at first 
expect after treating the special cast ¢ = 3. In order to explain relation (10) 
we shall generalize our original assumption that all values p, in the »** urn are 
equi-probable a priori. We shall assume that the probability for p, to be con- | 
tained in the interval (p,, p, + dp,) is given by the “differential’’ | 


) apd, — | f° a) = 1], | | | 


We shall require some conditions concerning the moments 


bk = I p'df(p), 


ee 
= 





Formula (3) is to be replaced by the more general formula 


(5) Jnn(p) = I dij ce Rig = gead «+> 0 — pO --- GW. 


‘It can be shown that these conditions are certainly fulfilled if the probability df(p) is 
wot concentrated in the two end points p = 0, 1 or in one intermediate point exclusively. 
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Relation (8) is again true and (10) is to be replaced by 





(16) Po = pt + o(1 — 2), 
where 

_ be _ br — be 
(17) on &! re Tou 


Relation (16) is the same as 


The number }; is the expected value of p with respect to the a priori probability 
df(p). We may also claim it to be the relative average of white balls that one 
should expect a priori. As such we denote it by e¢9. The number 6 = by — }3 
is the dispersion of the a priori distribution df(p). Hence we may write 


fr) 
mi - al — eo) (t €0). 
In other words, the probable relative average po is equal to the a priori average 
€y plus a certain multiple of the excess of the tested average ¢ over the expected 
average €). Also, the factor 5/[eo(1 — eo)] lies between 0 and 1. Thus we finally 
obtain the following statement: the probable average po is a weighted mean 
between the a priori average eo and the tested average t, the values of the weights 
depending only on the expectation and dispersion of the a priori probability 


df(p). 


Po = & + 


II 


If 6(p;, --- , py) is defined and absolutely integrable over the entire Euclidean 
space of the variables p,, and D(£) is 1 in the interval 7 S ¢ S ¢ and 0 outside, 
then the integral 


may be written in the form 
1 —inan e~ ifa 


where 


E(a) = -a9 is sal » pues t °° +PN) dy, aii dpy.? 


Substituting for (p,, --- , py) the expression 


ated _ dflpx) 


(19) Pr-++ Pm(l — pms) +++ (1 — py) dpy 








? See S. Bochner, Vorlesungen tiber Fouriersche Integrale, am s. 61, Satz 16. 
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in the unit cube (4)* and 0 outside, and putting » = —Np, ¢ = Np, we obtain 
jor the function (15) the identity 
DF ies sin Np 

(20) Im nlp) = - go(a)™ Yo(a)" ———P™ da, 
T J—w a 

where 


go( a) -{ pe*? df(p), — Yo(a) - | (1 — p) et” df(p). 
Obviously 
ev(a) = bi + Fh (ia) + 8 (ia)? + --. 


vo(a) =1—h+ oe (ta) + a 





a + «-- 
and 


Jim, n(1) ai b7(1 a b;)". 


Hence introducing the functions 


AA . be mt bs 2 
ola) = 1+i; « ab, +... 
va .b; — be bo — bs , 
Male ttre ee +: 
we obtain 
(2) Pn, x(p) -! / oa)” ¥(a)r NPA da, 


Since we only know 


| o(a) | $1, | ¥(a) | $1, 


this integral may be absolutely divergent at a = +. This complication will 
be avoided if we consider the function 


rH Ginko [ ” Pa, a(p) dp. 

Ithas the value 

log 2 7 a , (sin 4 Npa)? 3 
) 2 [lade v(ayn Sn dNPe” ca, 


Using the abbreviations (17) we now put 
ela) = ec” (a), Ya) = e** yr(a). 
SED 


‘The argument loc. cit. can be easily extended so as to cover an expression of the type 
(i), for an everywhere dense set of values n, §. 


a oe 
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In the neighborhood of a = 0 we have 
(24) ed =1-At+--, teat Bee s.: 


and it follows from (13) and (14) respectively that A > 0,B > 0. Hence there 
exists an ao > O such that for all —ay < a < aoand for all m, n 


(25) | gila)™ Vila)” ace e—(Am+Bn)a" | < C . Nai e-D+Na® 
where C and D denote suitable constants. Therefore Qm,»(p) differs from 


(sin 3 Npa)? q 
Ne 


o(frece a) +0([" $8) 000 


The positive numbers 


(26) > [ exp [—(Am + Bn)a? + i(pm + on)a 


—o 


by 


_ Am + nB _ pm + on 


(27) ‘ mtn’ m+n 


depend on m, n, but owing to our assumptions (12)-(14) they lie between posi- 
tive bounds. Replacing a by a/N, (26) may be written 


* ipa (sin 4 2 
(28) 2 gt ee ie. 
T J-e a 
This differs from 
% oe | 2 
(29) : give (Sind Pa)? g, 
T J—o a 


by a term which is majorized by 


[G E o*)dat [- (1 Z cre — O(N) + ON). 


The integral (29) can easily be evaluated.t It has the value O for p < u and 
p—pforp>u. Hence, altogether, 


" O(N") ifp < 
(30) [P Pa.ao) dp = ss ites ade 
0 p—pt+O(N-), if p> w; 


the error estimation O(N-}) appearing in this formula is valid uniformly in 
O S p S 1, and uniformly in all m, n for which m + n = N. Since Pn, n(p) is 
monotonely non-decreasing, (30) allows the conclusion 


Pn n(u — g) = O(N-* 9g) 


(31) 
Pn (lu +g) = 1+ 0N+ 9"), 


ifg > 0. 





4 Loc. cit., p. 15, formula (14). 
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Assuming now the existence of the limit (7), the quantity y will converge 
towards the number po that was defined by (16). And we may conclude from 
(31) that for each « > 0 

Pm,n(po — €-) = O(N) 


Pm,n(po + €-) = 1 + O(N~4). 


The estimate (30) can be strengthened. The expression (28) is the integral 
from 0 to p of the function 


ee) Aa? A ~~? 
(33) 2 f sete 


us 20 Qa 


(32) 





which has the value 


VR Hie OS ie a VN [?* -*® 














(34) —— —_ 4x dt + O(N). 
2V/ (mr) J-p 2V (rr) J-« ‘ wiki 
Hence, in terms of the error function 
1 * «of 
(35) r) = wt 4d 
5 y(x) ay wy r 
we obtain the relation 
P — p—u) V(N/X) 
(36) I Px. a(pdp = VW(X/N) / ew W(x) dz + O(N-) 
0 —2 
or 
Pp 
e / Pn. a(p)dp = ((p — u) ¥ ((p — ») VON) 
37 


N(p—n)? 


+VJSQ/Nae © +0(N-). 


This relation can be easily seen to include (30). For p = u, (37) becomes 





(38) £ Pm, n(p)dp = V/(d/Nr) + O(N). 


0 


Ill 


In order to obtain a relation similar to (37) for the function P,,,,(p) itself we 
nake more stringent assumptions concerning the nature of the a priori distri- 
bution df(p). We require that for every ao > O, 


(39) / ode [ : | o(a)™ Y(a)* | = O(N-). 


This will be amply satisfied if the distribution df(p) arises from a density, df(p) = 
(p)dp, and the density g(p) is of bounded variation. Using again (25) we find 
that Pn. a(p) differs from (33) by O(N-'). Hence we obtain 


\0) Pn, lp) = Wp — u) VA) + O(N). 


; 
| 


a a ee 
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In particular, uniformly in all positive and negative values g, 





g 1 * -T . 
41 Pal pt =) = [ota + O(N-4). 
(a1) (1 +5) = ew [eae + o- 
We now introduce the assumption 
te = —t 


which is more stringent than (7). Obviously » = po + O(N-'), and Xd = 
ho + O(N-') where 


Ao = At + BI — 2). 


It is easily seen that (33) differs only by O(N) from the same integral in which 
d, w are replaced by Xo, po. Therefore, (40) and (41) may be replaced by 


(42) Pm, n(p) = W(p — po) (N/Xo)) + O(N-) 


1 oie 
43 Pn.n -£) = — jam | aXe dt + O(N-). 
(43) (m+ iJ a € + O(N-+) 
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1. Introduction. One of the aspects of quantum theory which has attracted 
the most general attention, is the novelty of the logical notions which it pre- 
supposes. It asserts that even a complete mathematical description of a physi- 
cal system © does not in general enable one to predict with certainty the result 
of an experiment on ©, and that in particular one can never predict with cer- 
tainty both the position and the momentum of © (Heisenberg’s Uncertainty 
Principle). It further asserts that most pairs of observations are incompatible, 
and cannot be made on © simultaneously (Principle of Non-commutativity of 
Observations). 

The object of the present paper is to discover what logical structure one may 
hope to find in physical theories which, like quantum mechanics, do not con- 
form to classical logic. Our main conclusion, based on admittedly heuristic 
arguments, is that one can reasonably expect to find a calculus of propositions 
which is formally indistinguishable from the calculus of linear subspaces with 
respect to set products, linear sums, and orthogonal complements—and resembles 
the usual calculus of propositions with respect to and, or, and not. 

In order to avoid being committed to quantum theory in its present form, we 
have first (in §§2-6) stated the heuristic arguments which suggest that such a 
calculus is the proper one in quantum mechanics, and then (in §§7—-14) recon- 
structed this calculus from the axiomatic standpoint. In both parts an attempt 
has been made to clarify the discussion by continual comparison with classical 
mechanics and its propositional calculi. The paper ends with a few tentative 
conclusions which may be drawn from the material just summarized. 


I. PaysicaL BACKGROUND 


2. Observations on physical systems. The concept of a physically observ- 
able “physical system” is present in all branches of physics, and we shall 
assume it. 

It is clear that an “observation” of a physical system © can be described 
generally as a writing down of the readings from various! compatible measure- 
ments. Thus if the measurements are denoted by the symbols m1, --- , #n, then 








‘If one prefers, one may regard a set of compatible measurements as a single composite 
“measurement”’—and also admit non-numerical readings—without interfering with subse- 
quent arguments. 

Among conspicuous observables in quantum theory are position, momentum, energy, 
and (non-numerical) symmetry. 
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an observation of S amounts to specifying numbers 2, --- , 2, corresponding 
to the different py, . 

It follows that the most general form of a prediction concerning © is that the 
point (11, --- ,2%n) determined by actually measuring yi, --- , un, will lie ing 
subset S of (21, --- , 2n)-space. Hence if we call the (71, --- , 2,)-spaces asso- 
ciated with G, its ““observation-spaces,’’ we may call the subsets of the observa- 
tion-spaces associated with any physical system ©, the “experimental propo- 
sitions’ concerning ©. 


3. Phase-spaces. There is one concept which quantum theory shares alike 
with classical mechanics and classical electrodynamics. This is the concept of a 
mathematical “phase-space.”’ 

According to this concept, any physical system © is at each instantly hypo- 
thetically associated with a “point” p in a fixed phase-space 2; this point is 
supposed to represent mathematically the “state’’ of S, and the “state” of S is 
supposed to be ascertainable by “maximal’’? observations. 

Furthermore, the point pp associated with © at a time &, together with a pre- 
scribed mathematical “law of propagation,” fix the point p, associated with © 
at any later time ¢; this assumption evidently embodies the principle of mathe- 
matical causation.* 

Thus in classical mechanics, each point of 2 corresponds to a choice of n 
position and n conjugate momentum coérdinates—and the law of propagation 
may be Newton’s inverse-square law of attraction. Hence in this case > is a 
region of ordinary 2n-dimensional space. In electrodynamics, the points of 2 
can only be specified after certain functions—such as the electromagnetic and 
electrostatic potential—are known; hence 2 is a function-space of infinitely many 
dimensions. Similarly, in quantum theory the points of Z correspond to so-called 
‘‘wave-functions,” and hence > is again a function-space—usually* assumed to 
be Hilbert space. 

In electrodynamics, the law of propagation is contained in Maxwell’s equa- 
tions, and in quantum theory, in equations due to Schrédinger. In any case, 
the law of propagation may be imagined as inducing a steady fluid motion in 
the phase-space. 

It has proved to be a fruitful observation that in many important cases of 
classical dynamics, this flow conserves volumes. It may be noted that in 
quantum mechanics, the flow conserves distances (i.e., the equations are “uni- 
tary’’). 





? L. Pauling and E. B. Wilson, “An introduction to quantum mechanics,’’ McGraw-Hill, 
1935, p. 422. Dirac, “Quantum mechanics,’’ Oxford, 1930, §4. 

* For the existence of mathematical causation, cf. also p. 65 of Heisenberg’s ‘‘The physical 
principles of the quantum theory,’’ Chicago, 1929. : 

* Cf. J. von Neumann, ‘“‘Mathematische Grundlagen der Quanten-mechanik,’’ Berlin, 1931. 
p. 18. 
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4. Propositions as subsets of phase-space. Now before a phase-space can 
become imbued with reality, its elements and subsets must be correlated in 
some way with “experimental propositions” (which are subsets of different 
observation-spaces). Moreover, this must be so done that set-theoretical inclu- 
sion (which is the analogue of logical implication) is preserved. 

There is an obvious way to do this in dynamical systems of the classical type.* 
One can measure position and its first time-derivative velocity—and hence 
momentum—explicitly, and so establish a one-one correspondence which pre- 
serves inclusion between subsets of phase-space and subsets of a suitable obser- 
vation-space. 

In the cases of the kinetic theory of gases and of electromagnetic waves no 
such simple procedure is possible, but it was imagined for a long time that 
“demons” of small enough size could by tracing the motion of each particle, or 
by a dynamometer and infinitesimal point-charges and magnets, measure quan- 
tities corresponding to every coérdinate of the phase-space involved. 

In quantum theory not even this is imagined, and the possibility of predicting 
in general the readings from measurements on a physical system © from a knowl- 
edge of its “state”’ is denied; only statistical predictions are always possible. 

This has been interpreted as a renunciation of the doctrine of pre-determina- 
tion; a thoughtful analysis shows that another and more subtle idea is involved. 
The central idea is that physical quantities are related, but are not all computable 
from a number of independent basic quantities (such as position and velocity).® 

We shall show in §12 that this situation has an exact algebraic analogue in the 
calculus of propositions. 


5. Propositional calculi in classical dynamics. Thus we see that an un- 
critical acceptance of the ideas of classical dynamics (particularly as they 
involve n-body problems) leads one to identify each subset of phase-space with 
an experimental proposition (the proposition that the system considered has 
position and momentum codérdinates satisfying certain conditions) and con- 
versely. 

This is easily seen to be unrealistic; for example, how absurd it would be to 
call an “experimental proposition,” the assertion that the angular momentum 
(in radians per second) of the earth around the sun was at a particular instant a 
rational number! 

Actually, at least in statistics, it seems best to assume that it is the Lebesgue- 
measurable subsets of a phase-space which correspond to experimental proposi- 
tions, two subsets being identified, if their difference has Lebesgue-measure 0.’ 





* Like systems idealizing the solar system or projectile motion. 

* A similar situation arises when one tries to correlate polarizations in different planes of 
electromagnetic waves. 

"Cf. J. von Neumann, “Operatorenmethoden in der klassischen Mechanik,” Annals of 
Math. 33 (1932), 595-8. The difference of two sets S,, S2 is the set (S: + S2) — Si-S: of 
those points, which belong to one of them, but not to both. 
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But in either case, the set-theoretical sum and product of any two subsets, and 
the complement of any one subset of phase-space corresponding to experimental 
propositions, has the same property. That is, by definition® 
The experimental propositions concerning any system in classical mechanics, 
correspond to a “‘field’”’ of subsets of its phase-space. More precisely: To the 
“quotient” of such a field by an ideal in it. At any rate they form a “Boolean 
Algebra.’”® | 
In the axiomatic discussion of propositional calculi which follows, it will be 
shown that this is inevitable when one is dealing with exclusively compatible 
measurements, and also that it is logically immaterial which particular field of 
sets is used. 


i 6. A propositional calculus for quantum mechanics. The question of the 
connection in quantum mechanics between subsets of observation-spaces (or 


“experimental propositions”) and subsets of the phase-space of a system G, ( 
has not been touched. The present section will be devoted to defining such a ’ 
connection, proving some facts about it, and obtaining from it heuristically by 
introducing a plausible postulate, a propositional calculus for quantum me- ¢ 
chanics. 8 
West Accordingly, let us observe that if a1, --- , an are any compatible observations t 
( mt if on a quantum-mechanical system © with phase-space 2, then’ there exists a set ti 
Mfr eee of mutually orthogonal closed linear subspaces 2; of 2 (which correspond to the 
Hs families of proper functions satisfying aif = Ainf,--- , anf = inf) such that E 
every point (or function) f e 2 can be uniquely written in the form 0 
f = cifit cofe+ csfet +++ [fie Qs] . 
Hence if we state the of 
DeriniTIon: By the “mathematical representative” of a subset S of any e] 
observation-space (determined by compatible observations a1, --+ , an) for a re 
quantum-mechanical system ©, will be meant the set of all points f of the phase- 
space of S, which are linearly determined by proper functions f;, satisfying ca 
af, = Mfz, +++, Onfe = Anfx, Where (Ai, --+ , An) €S. re 
Then it follows immediately: (1) that the mathematical representative of any pa 


experimental proposition is a closed linear subspace of Hilbert space (2) since 
all operators of quantum mechanics are Hermitian, that the mathematical 
representative of the negative! of any experimental proposition is the orthogonal 








* F. Hausdorff, ““Mengenlehre,’’ Berlin, 1927, p. 78. Ph 
*M. H. Stone, ‘‘Boolean Algebras and their application to topology,’’ Proc. Nat. Acad. 20 
(1934), p. 197. pr 
10 Cf. von Neumann, op. cit., pp. 121, 90, or Dirac, op. cit., 17. We disregard complica- | 
tions due to the possibility of a continuous spectrum. They are inessential in the pres- p.. 
ent case. lin 
11 By the ‘“‘negative’’ of an experimental proposition (or subset S of an observation- coi 
space) is meant the experimental proposition corresponding to the set-complement of S in Hil 


the same observation-space. int 
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complement of the mathematical representative of the proposition itself (3) the 
following three conditions on two experimental propositions P and Q concerning 
a given type of physical system are equivalent: 

(3a) The mathematical representative of P is a subset of the mathematical 
representative of Q. 

(3b) P implies @Q—that is, whenever one can predict P with certainty, one can 
predict Q with certainty. 

(3c) For any statistical ensemble of systems, the probability of P is at most 
the probability of Q. 

The equivalence of (3a)—(3c) leads one to regard the aggregate of the mathe- 
matical representatives of the experimental propositions concerning any physical 
system S, as representing mathematically the propositional calculus for S. 

We now introduce the 

PostuLaTE: The set-theoretical product of any two mathematical representatives 
of experimental propositions concerning a quantum-mechanical system, is itself the 
mathematical representative of an experimental proposition. 

Remarks: This postulate would clearly be implied by the not unnatural 
conjecture that all Hermitian-symmetric operators in Hilbert space (phase- 
space) correspond to observables;! it would even be implied by the conjecture 
that those operators which correspond to observables coincide with the Hermi- 
tian-symmetric elements of a suitable operator-ring M." 

Now the closed linear sum Q; + Q: of any two closed linear subspaces Q; of 
Hilbert space, is the orthogonal complement of the set-product @}-Q; of the 
orthogonal complements @; of the Q;; hence if one adds the above postulate to the 
usual postulates of quantum theory, then one can deduce that 

The set-product and closed linear sum of any two, and the orthogonal complement 
of any one closed linear subspace of Hilbert space representing mathematically an 
experimental proposition concerning a quantum-mechanical system ©, itself 
represents an experimental proposition concerning ©. 

This defines the calculus of experimental propositions concerning ©, as a 
calculus with three operations and a relation of implication, which closely 
resembles the systems defined in §5. We shall now turn to the analysis and com- 
parison of all three calculi from an axiomatic-algebraic standpoint. 


II. ALGEBRAIC ANALYSIS 


7. Implication as partial ordering. It was suggested above that in any 
physical theory involving a phase-space, the experimental propositions concern- 





"T.e., that given such an operator a, one “‘could’’ find an observable for which the 
proper states were the proper functions of a. 

*F. J. Murray and J. v. Neumann, “On rings of operators,’ Annals of Math., 37 (1936), 
p. 120. It is shown on p. 141, loc. cit. (Definition 4.2.1 and Lemma 4.2.1), that ‘the closed 
linear sets of a ring M—that is those, the “‘projection operators’ of which belong to M— 
coincide with the closed linear sets which are invariant under a certain group of rotations of 
Hilbert space. And the latter property is obviously conserved when a set- theoretical 
intersection is formed. 
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ing a system © correspond to a family of subsets of its phase-space =, in such a 
way that “x implies y” (x and y being any two experimental propositions) 
means that the subset of 2 corresponding to zx is contained set-theoretically in 
the subset corresponding to y. This hypothesis clearly is important in propor- 
tion as relationships of implication exist between experimental propositions 
corresponding to subsets of different observation-spaces. 

The present section will be devoted to corroborating this hypothesis by identi- 
fying the algebraic-axiomatic properties of logical implication with those of set- 
inclusion. 

It is customary to admit as relations of “implication,” only relations satisfy- 
ing 
S1: x implies z. 
$2: If x implies y and y implies z, then z implies z. 
$3: If x implies y and y implies z, then x and y are logically equivalent. 


In fact, 83 need not be stated as a postulate at all, but can be regarded as a 
definition of logical equivalence. Pursuing this line of thought, one can interpret 
as a ‘‘physical quality,” the set of all experimental propositions logically equiva- 
lent to a given experimental proposition." 

Now if one regards the set S, of propositions implying a given proposition z as 
a “mathematical representative” of z, then by S3 the correspondence between 
the x and the S, is one-one, and x implies y if and only if S; © S,. While con- 
versely, if L is any system of subsets X of a fixed class I’, then there is an iso- 
morphism which carries inclusion into logical implication between L and the 
system L* of propositions “x is a point of X,” X e L. 

Thus we see that the properties of logical implication are indistinguishable 
from those of set-inclusion, and that therefore it is algebraically reasonable to try 
to correlate physical qualities with subsets of phase-space. 

A system satisfying $1-S3, and in which the relation ‘“‘z implies y” is written 
xz Cy, is usually called a “partially ordered system,”’ and thus our first postu- 
late concerning propositional calculi is that the physical qualities attributable to 
any physical system form a partially ordered system. 

It does not seem excessive to require that in addition any such calculus contain 
two special propositions: the proposition [] that the system considered exists, 
and the proposition © that it does not exist. Clearly 


S4: © C xC [J for any z. 
Ois, from a logical standpoint, the “identically false” or “absurd” proposition; 


[Jis the “identically true” or “self-evident” proposition. 


8. Lattices. In any calculus of propositions, it is natural to imagine that 
there is a weakest proposition implying, and a strongest proposition implied by, 





14 Thus in 6, closed linear subspaces of Hilbert space correspond one-many to experi- 
mental propositions, but one-one to physical qualities in this sense. 
1° F. Hausdorff, “‘Grundztige der Mengenlehre,”’ Leipzig, 1914, Chap. VI, §1. 
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a a given pair of propositions. In fact, investigations of partially ordered systems 
s) from different angles all indicate that the first property which they are likely to 
in possess, is the existence of greatest lower bounds and least upper bounds to sub- 
r- sets of their elements. Accordingly, we state 

ns DeriniT1Ion: A partially ordered system L will be called a “lattice” if and 


only if to any pair z and y of its elements there correspond 


wt eer 


$5: A “meet” or “greatest lower bound” xz M y such that (5a) z N y Ca, (5b) 
cNy Cy, (5c)z Cxzandz Cyimplyz C2rN y. 
$6: A “join” or “least upper bound” z / y satisfying (6a) x U y D z, (6b) 











y- a UyDy, (6c) w Dzandw Dyimply w Dz U y. ; 
The relation between meets and joins and abstract inclusion can be sum- 
marized as follows,'® 
(8.1) In any lattice L, the following formal identities are true, ( 
l:aNla=aanda VUa=a. i 
m2 2:aNb=bNaandaUb=bUa. ia 
L3:aN (b Nc) = (afl b) Ncanda U (b Uc) = (a UD) Ue. alte 
I4:aU (afb) =afN (a U Bb) = a. te 
as Moreover, the relations a > b, a NM b = b, anda U b = aare equivalent—each 4 | i 
en implies both of the others. HE 
n- , (8.2) Conversely, in any set of elements satisfying L2—L4 (L1 is redundant), ' + 
0- afl b = banda U b = aare equivalent. And if one defines them to mean | pee 
he a > b, then one reveals L as a lattice. | 
Clearly L1—L4 are well-known formal properties of and and or in ordinary | 
le logic. This gives an algebraic reason for admitting as a postulate (if necessary) a> 
ry the statement that a given calculus of propositions is a lattice. There are other a 
reasons” which impel one to admit as a postulate the stronger statement that the | | 
en set-product of any two subsets of a phase-space which correspond to physical ¥ | 
U- qualities, itself represents a physical quality—this is, of course, the Postulate 4 | 
to of §6. 
It is worth remarking that in classical mechanics, one can easily define the 
\in meet or join of any two experimental propositions as an experimental proposi- 
ts, tion—simply by having independent observers read off the measurements which 
either proposition involves, and combining the results logically. This is true in 
quantum mechanics only exceptionally—only when all the measurements in- 
volved commute (are compatible); in general, one can only express the join or 
- ‘6 The final result was found independently by O. Ore, “The foundations of abstract 
algebra. I.,’? Annals of Math. 36 (1935), 406-37, and by H. MacNeille in his Harvard Doc- 
toral Thesis, 1935. 
at '' The first reason is that this implies no restriction on the abstract nature of a lattice— 
oy; any lattice can be realized as a system of its own subsets, in such a way that af) b is the set- 
‘ product of a and b. The second reason is that if one regards a subset S of the phase-space of 
- | asystem © as corresponding to the certainty of observing © in S, then it is natural to assume 


that the combined certainty of observing S in S and T is the certainty of observing © in 
S-T = S(\T,—and assumes quantum theory. 
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meet of two given experimental propositions as a class of logically equivalent 
experimental propositions—i.e., as a physical quality. 


9. Complemented lattices. Besides the (binary) operations of meet- and 
join-formation, there is a third (unary) operation which may be defined in par- 
tially ordered systems. This is the operation of complementation. 

In the case of lattices isomorphic with “fields’’ of sets, complementation corre- 
sponds to passage to the set-complement. In the case of closed linear sub- 
spaces of Hilbert space (or of Cartesian n-space), it corresponds to passage to the 
orthogonal complement. In either case, denoting the “complement” of an 
element a by a’, one has the formal identities, 


L71: (a’)’ = a. 
L72:aN a’ = ©anda Ua = I). 
L73: a C b implies a’ D Ob’. 


By definition, L71 and L73 amount to asserting that complementation is a 
“dual automorphism” of period two. It is an immediate corollary of this and the 
duality between the definitions (in terms of inclusion) of meet and join, that 


L74: (a N b)’ = a’ U db’ and (a U B)’ = a’ NO’ 


and another corollary that the second half of L72 is redundant. [Proof: by L71 
and the first half of L74, (a U a’) = (a” U a’) = (a’ N a)’ = ©, while under 
inversion of inclusion © evidently becomes |].] This permits one to deduce L72 
from the even weaker assumption that a C a’ impliesa = ©. Proof: for any z, 
(cx Nz’) =(e’ Ue") =e Ur D2eN ez’. 

Hence if one admits as a postulate the assertion that passage from an experi- 
mental proposition a to its complement a’ is a dual automorphism of period two, and 
aimplies a’ is absurd, one has in effect admitted L71-L74. 

This postulate is independently suggested (and L71 proved) by the fact the 
“complement” of the proposition that the readings 21, --- , 2» from a series of 
compatible observations p, --~ , un lie in a subset S of (21, --- , 2n)-Space, is by 
definition the proposition that the readings lie in the set-complement of S. 


10. The distributive identity. Up to now, we have only discussed formal 
features of logical structure which seem to be common to classical dynamics and 
the quantum theory. We now turn to the central difference between them—the 
distributive identity of the propositional calculus: 


L6:a4U (b6N c)=(Udb)N (aU chandaN (bUc) = (aN b) U (afc) 


which is a law in classical, but not in quantum mechanics. 





18 The following point should be mentioned in order to avoid misunderstanding: If a, 6 
are two physical qualities, then aU b, afb and a’ (cf. below) are physical qualities too (and 
so are © and []+). But aCbis not a physical quality; it is a relation between physi- 
cal qualities. 
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From an axiomatic viewpoint, each half of L6 implies the other. Further, 
either half of L6, taken with L72, implies L71 and L73, and to assume L6 and 
L72 amounts to assuming the usual definition of a Boolean algebra.” 

From a deeper mathematical viewpoint, L6 is the characteristic property of 
set-combination. More precisely, every “field” of sets is isomorphic with a 
Boolean algebra, and conversely.*_ This throws new light on the well-known 
fact that the propositional calculi of classical mechanics are Boolean algebras. 

It is interesting that L6 is also a logical consequence of the compatibility of the 
observables occurring in a, b, andc. That is, if observations are made by inde- 
pendent observers, and combined according to the usual rules of logic, one can 
prove L1-LA, L6, and L71-74. 

These facts suggest that the distributive law may break down in quantum 
mechanics. That it does break down is shown by the fact that if a denotes the 
experimental observation of a wave-packet y on one side of a plane in ordinary 
space, a’ correspondingly the observation of y on the other side, and b the obser- 
vation of y in a state symmetric about the plane, then (as one can readily check) : 


bN (aUa’) =bND=b>O= (Na) = Na’) 
= (bNa) U (bNa’) 


Remark: In connection with this, it is a salient fact that the generalized 
distributive law of logic: 


L6*: II (= as) =) (iI as) 
t=1 j=1 i(i) i=1 

breaks down in the quotient algebra of the field of Lebesgue measurable sets by 

the ideal of sets of Lebesgue measure 0, which is so fundamental in statistics and 


the formulation of the ergodic principle.” 


11. The modular identity. Although closed linear subspaces of Hilbert 
space and Cartesian n-space need not satisfy L6 relative to set-products and 
closed linear sums, the formal properties of these operations are not confined to 
L1-L4 and L71-L73. 

In particular, set-products and straight linear sums are known” to satisfy the 
so-called “modular identity.” 


’? R. Dedekind, ‘‘Werke,’’ Braunschweig, 1931, vol. 2, p. 110. 

* G. Birkhoff, ‘‘On the combination of subalgebras,’ Proc. Camb. Phil. Soc. 29 (1933), 
441-64, §§23-4. Also, in any lattice satisfying L6, isomorphism with respect to inclusion 
implies isomorphism with respect to complementation; this need not be true if L6 is not 
assumed, as the lattice of linear subspaces through the origin of Cartesian n-space shows. 

*1M. H. Stone, “Boolean algebras and their application to topology,’’ Proc. Nat. Acad. 20 
(1934), 197-202. 

* A detailed explanation will be omitted, for brevity; one could refer to work of G. D. 
Birkhoff, J. von Neumann, and A. Tarski. 

*8 G. Birkhoff, op. cit., §28. The proofiseasy. One first notes that since a C (aUb) fc 
ifaCc, andbf\¥cC (aU b) N cin any case, aU (bf c) C (aU b) fc. Then one notes 
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L5: Ifa Coe, thena U (bNc) = AUD) Ne. 


Therefore (since the linear sum of any two finite-dimensional linear subspaces of 
Hilbert space is itself finite-dimensional and consequently closed) set-products 
and closed linear sums of the finite dimensional subspaces of any topological 
linear space such as Cartesian n-space or Hilbert space satisfy L5, too. 

One can interpret L5 directly in various ways. First, it is evidently a re- 
stricted associative law on mixed joins and meets. It can equally well be re- 
garded as a weakened distributive law, since if a Cc, then a U (bf c) = 
(afc) U (bNc)and(a Ub) Ne = (aUb)N (aU c). And it is self-dual: 
replacing C, M, U by D, U, MN merely replaces a, b, c, by c, b, a. 

Also, speaking graphically, the assumption that a lattice L is “modular” 
(i.e., satisfies L5) is equivalent to*% saying that L contains no sublattice iso- 
morphic with the lattice graphed in fig. 1: 


(a) 
a 


Fic. 1 


(b) 


Thus in Hilbert space, one can find a counterexample to L5 of this type. 
Denote by £1, &, &, --- a basis of orthonormal vectors of the space, and by 
a, b, and c¢ respectively the closed linear subspaces generated by the vectors 
(Eo, + 10-"&; + 10-°"&,41), by the vectors &,, and by a and the vector £1. 
Then a, b, and c generate the lattice of Fig. 1. 

Finally, the modular identity can be proved to be a consequence of the assump- 
tion that there exists a numerical dimension-function d(a), with the properties 


D1: If a > b, then d(a) > d(b). 
D2: d(a) + d(b) = d(a N b) + d(a U B). 


This theorem has a converse under the restriction to lattices in which there is a 
finite upper bound to the length n of chains® © < a; < a2 <--- <a, < [of 
elements. 

Since conditions D1-D2 partially describe the formal properties of prob- 
ability, the presence of condition L5 is closely related to the existence of an 





that any vector in (a U b) f) c can be written & =a+Blaea,Beb,&ecl. Butp=iE-—@ 
is in c (since £ ec anda eaCc); henceté=at+Beau (bf\c), and aU (bNc) D (aUb) Ne, 
completing the proof. 

*4 R. Dedekind, ‘‘Werke,’’ vol. 2, p. 255. 

*s The statements of this paragraph are corollaries of Theorem 10.2 of G. Birkhoff, 
op. cit. 








LOGIC OF QUANTUM MECHANICS 833 


“a priori thermo-dynamic weight of states.” But it would be desirable to 
interpret L5 by simpler phenomenological properties of quantum physics. 


12. Relation to abstract projective geometries. We shall next investigate 
how the assumption of postulates asserting that the physical qualities attrib- 
utable to any quantum-mechanical system © are a lattice satisfying L5 and 
L71-L73 characterizes the resulting propositional calculus. This question is 
evidently purely algebraic. 

We believe that the best way to find this out is to introduce an assumption 
limiting the length of chains of elements (assumption of finite dimensions) of 
the lattice, admitting frankly that the assumption is purely heuristic. 

It is known* that any lattice of finite dimensions satisfying L5 and L72 is the 
direct product of a finite number of abstract projective geometries (in the sense 
of Veblen and Young), and a finite Boolean algebra, and conversely. 

Remark: It is a corollary that a lattice satisfying L5 and L71—L73 possesses 
independent basic elements of which any element is a union, if and only if it is a 
Boolean algebra., 

Again, such a lattice is a single projective geometry if and only if it is irre- 
ducible—that is, if and only if it contains no “neutral” elements.” 2+ ©, |] 
such that a = (a MN x) U (aN 2’) foralla. In actual quantum mechanics such 
an element would have a projection-operator, which commutes with all projec- 
tion-operators of observables, and so with all operators of observables in general. 
This would violate the requirement of “irreducibility” in quantum mechanics.” 
Hence we conclude that the propositional calculus of quantum mechanics has the 
same structure as an abstract projective geometry. 

Moreover, this conclusion has been obtained purely by analyzing internal 
properties of the calculus, in a way which involves Hilbert space only indirectly. 


13. Abstract projective geometries and skew-fields. We shall now try to 
get a fresh picture of the propositional calculus of quantum mechanics, by 
recalling the well-known two-way correspondence between abstract projective 
geometries and (not necessarily commutative) fields. 

Namely, let F be any such field, and consider the following definitions and con- 


structions: n elements 21, --- , 2, of F, not all = 0, form a right-ratio [z,:---:2,],, 
two right-ratios [1,:---:2,],, and [£1:---2&n], being called “equal,” if and only 
ifazeF with & = 2,2,1 = 1,---,n, exists. Similarly, n elements y:, --- , Yn 
of F, not all = 0, form a left-ratio [yi:---:ynl, two left-ratios [yi:--- ‘yn 
and [m:---:on): being called “equal,” if and only if a z in F with n; = zy, 
+= 1, , n, exists. 





**G. Birkhoff ‘Combinatorial relations in projective geometries,’ Annals of Math. 36 
(1935), 743-8. 

*7 QO. Gre, op. cit., p. 419. 

*8 Using the Sue AP of footnote,' and of loc. cit. there: The ring MM’ should con- 
tain no other projectidn-operators than 0, 1, or: the ring M must be a ‘“factor.’? Cf. loc. 
cit.8, p. 120. 
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Now define an n — 1-dimensional projective geometry P,_;(F) as follows: 
The “points” of P,:(F) are all right-ratios [r:---:2,],.. The “linear sub- 
spaces” of P,,:(F) are those sets of points, which are defined by systems of 
equations 


ant +--+ + Aintn = O, k=1,---,m. 


(m = 1,2, --- , the ax; are fixed, but arbitrary elements of F). The proof, that 
this 7s an abstract projective geometry, amounts simply to restating the basic 
properties of linear dependence.” 

The same considerations show, that the (n — 2-dimensional) hyperplanes in 
P,,-1(F) correspond tom = 1, notalla;= 0. Putai; = y;, then we have 


(*) Yiti + +++ + Ynt, = O, not all y; = 0. 


This proves, that the (n — 2-dimensional) hyperplanes in P,,_:(F) are in a one- 
to-one correspondence with the left-ratios [y,:---:ynlt. 

So we can identify them with the left-ratios, as points are already identical 
with the right-ratios, and (*) becomes the definition of “incitlence” (point C 


hyperplane). 
Reciprocally, any abstract n — 1-dimensional projective geometry Q,_; with 
n = 4, 5, --- belongs in this way to some (not necessarily commutative field 


F(Q,-1), and Q,_; is isomorphic with Pn_i(F(Qn_1)). 


14. Relation of abstract complementarity to involutory anti-isomorphisms in 
skew-fields. We have seen that the family of irreducible lattices satisfying L5 
and L72 is precisely the family of projective geometries, provided we exclude the 
two-dimensional case. But what about L71 and L73? In other words, for 
which P,_:(F) can one define complements possessing all the known formal 
properties of orthogonal complements? The present section will be spent in 
answering this question. 





29 Cf. §§103-105 of B. L. Van der Waerden’s ‘‘Moderne Algebra,’’ Berlin, 1931, Vol. 2. 

© n = 4,5, --- means of course n — 1 2 3, that is, that Qn_: is necessarily a ‘‘Desargue- 
sian’ geometry. (Cf. O. Veblen and J. W. Young, ‘‘Projective Geometry,’’ New York, 1910, 
Vol. 1, page 41). Then F = F(Q,_;) can be constructed in the classical way. (Cf. Veblen 
and Young, Vol. 1, pages 141-150). The proof of the isomorphism between Q,-: and the 
P,-1(F) as constructed above, amounts to this: Introducing (not necessarily commutative) 
homogeneous codrdinates x,, --- , tn from F in Qn_1, and expressing the equations of hyper- 
planes with their help. This can be done in the manner which is familiar in projective 
geometry, although most books consider the commutative (‘‘Pascalian’’) case only. D. 
Hilbert, ‘‘Grundlagen der Geometrie,”’ 7th edition, 1930, pages 96-103, considers the non- 
commutative case, but for affine geometry, and n — 1 = 2, 3 only. 

Considering the lengthy although elementary character of the complete proof, we pro- 
pose to publish it elsewhere. 

%0 R. Brauer, “A characterization of null systems in projective space,” Bull. Am. Math. 
Soc. 42 (1936), 247-54, treats the analogous question in the opposite case that X M X’ 
~ © is postulated. 
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First, we shall show that it is sufficient that F admit an involutory antiso- 
morphism W: = = W(z), that is: 


Ql. w(w(u)) = u, 
Q2. wu + v) = w(u) + wir), 
Q3. w(uv) = w(v) w(u), 


with a definite diagonal Hermitian form w(x1)y1 £1 + --- + w(tn)¥n£n, Where 


Q4. w(ti)yit1 + +++ + W(tn)¥ntn = O implies 7; = --- = x, = 0, 
the y; being fixed elements of F, satisfying w(y;) = 7;. 
Proof: Consider ennuples (not right- or left-ratios!) x:(21,---, 2), &: 
(f:, +++ , &,) of elements of F. Define for them the vector-operations 
2: (412, +++ , Ln2) (z in F), 


z+ g: (11 + fi, tet y Int fn), 
and an “inner product”’ 
(E12) = wlé)yiti + +++ + WlEn)Yntn- 


Then the following formulas are corollaries of Q1-Q4. 


IP2 (é, ru) = (E, x)u, (gu, x) = w(u)(E, 2), 
IP3 (&, 2’ + 2’) = (&, 2’) + (&, 2”), (&% + &", x) = (8, x) + (&", 2), 
IP4 (2,2) = w((z, x)) = [zl is ¥ Oif e ¥ O (that is, if any 7; ¥ 0). 


We can define z _L & (in words: “z is orthogonal to ~’’) to mean that (£, 7) = 0. 


of points of P,_:(F), which by Q4 does not contain b itself, and yet with b gener- 
ates whole projective space P,_:(F), since for any ennuple x: (2%, --- , Zn) 


a= a’ + €-[8(, 2) 


where by Q4, [¢] ¥ 0, and by IP (é, x’) = 0. This linear subspace is, therefore, 
an n-2-dimensional hyperplane. 

Hence if c is any k-dimensional element of P,1(F): one can set up inductively 
k mutually polar points b®, --- ,b inc. Then it is easy to show that the set 
c’ of points polar to every 6, --- , b®—or equivalently to every point in c— 
constitute an n-k-1-dimensional element, satisfying cN c’ = ©andec Uc’ ={). 
Moreover, by symmetry (c’)’ D c, whence by dimensional considerations 
ce’ =¢. Finally, c Dd implies c’ Cd’, and so the correspondence c — c’ defines 
an involutory dual automorphism of P,,:(F) completing the proof. 
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In the Appendix it will be shown that this condition is also necessary. Thus 
the above class of systems is exactly the class of irreducible lattices of finite 
dimensions > 3 satisfying L5 and L71-L73. 


III. Conc.Lusions 


15. Mathematical models for propositional calculi. One conclusion which 
can be drawn from the preceding algebraic considerations, is that one can 
construct many different models for a propositional calculus in quantum me- 
chanics, which cannot be differentiated by known criteria. More precisely, one 
can take any field F having an involutory anti-isomorphism satisfying Q4 (such 
fields include the real, complex, and quaternion number systems*), introduce 
suitable notions of linear dependence and complementarity, and then construct 
for every dimension-number n a model P,,(F), having all of the properties of the 
propositional calculus suggested by quantum-mechanics. 

One can also construct infinite-dimensional models P,,(F) whose elements 
consist of all closed linear subspaces of normed infinite-dimensional spaces. But 
philosophically, Hankel’s principle of the “perseverance of formal laws’’ (which 
leads one to try to preserve L5)*®? and mathematically, technical analysis of 
spectral theory in Hilbert space, lead one to prefer a continuous-dimensional 
model P.(F), which will be described by one of us in another paper. 

P.(F) is very analogous with the model furnished by the measurable subsets 
of phase-space in classical dynamics.*4 


16. The logical coherence of quantum mechanics. The above heuristic 
considerations suggest in particular that the physically significant statements in 
quantum mechanics actually constitute a sort of projective geometry, while the 
physically significant statements concerning a given system in classical dynamics 
constitute a Boolean algebra. . 

They suggest even more strongly that whereas in classical mechanics any 
propositional calculus involving more than two propositions can be decomposed 
into independent constituents (direct sums in the sense of modern algebra), 
quantum theory involves irreducible propositional calculi of unbounded com- 
plexity. This indicates that quantum mechanics has a greater logical coherence 





*! In the real case, w(x) = x; in the complex case, w(x + iy) = x — iy; in the quaternionic 
case, w(u + ix + jy + kz) = u — ix — jy — kz; in all cases, the \; are 1. Conversely, A. 
Kolmogoroff, “‘Zur Begriindung der projektiven Geometrie,’”’ Annals of Math. 33 (1932), 175-6 
has shown that any projective geometry whose k-dimensional elements havea locally 
compact topology relative to which the lattice operations are continuous, must be over the 
real, the complex, or the quaternion field. 

*2 L5 can also be preserved by the artifice of considering in P,,(F) only elements which 
either are or have complements which are of finite dimensions. 

3 J. von Neumann, “Continuous geometries,’ Proc. Nat. Acad., 22 (1936), 92-100 and 
101-109. These may be a more suitable frame for quantum theory, than Hilbert space. 

34 In quantum mechanics, dimensions but not complements are uniquely determined by 
the inclusion relation; in classical mechanics, the reverse is true! 
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than classical mechanics—a conclusion corroborated by the impossibility in 
general of measuring different quantities independently. 


17. Relation to pure logic. The models for propositional calculi which 
have been considered in the preceding sections are also interesting from the 
standpoint of pure logic. Their nature is determined by quasi-physical and 
technical reasoning, different from the introspective and philosophical considera- 
tions which have had to guide logicians hitherto. Hence it is interesting to com- 
pare the modifications which they introduce into Boolean algebra, with those 
which logicians on “intuitionist”’ and related grounds have tried introducing. 

The main difference seems to be that whereas logicians have usually assumed 
that properties L71—L73 of negation were the ones least able to withstand a 
critical analysis, the study of mechanics points to the distributive identities L6 as 
the weakest link in the algebra of logic. Cf. the last two paragraphs of §10. 

Our conclusion agrees perhaps more with those critiques of logic, which find 
most objectionable the assumption that a’ U b = [] implies a C b (or dually, 
the assumption that a NM b’ = © implies b D a—the assumption that to deduce 
an absurdity from the conjunction of a and not b, justifies one in inferring that 
a implies 6). 


18. Suggested questions. The same heuristic reasoning suggests the follow- 
ing as fruitful questions. 

What experimental meaning can one attach to the meet and join of two given 
experimental propositions? 

What simple and plausible physical motivation is there for condition L5? 


APPENDIX 


1. Consider a projective geometry Q,_1 as described in §13. F is a (not neces- 
sarily commutative, but associative) field, n = 4, 5,--- , Qu-1 = Pr-i(F) the 


Q,-1. The (n — 2-dimensional) hyperplanes are represented by the left-ratios 
[yi:-++ Yn, incidence of a point [71:--- zn], and of a hyperplane [y::-- -? yn: 
being defined by 


(1) > Y= 0 
i=1 
All linear subspaces of Q,.: form the lattice L, with the elements a, }, c, - - 
Assume now that an operation a’ with the properties L71—L73 in §9 exists: 


L71 (a’)'/=a 
L72 afNa’ = Ganda Ua =f), 
L73 a Cb implies a’ > 0’. 





* It is not difficult to show, that assuming our axioms L1-5 and 7, the distributive law 
L6 is equivalent to this postulate: a’ U b = Jimplies a Cb. 
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They imply (cf. §9) 
L74 (af b)’ =a’ U b’ and (a U Bb)’ = a’ ND’. 
Observe, that the relation a C b’ is symmetric in a, b, owing to L73 and L71. 


2. If a: [x1:---:2,],is a point, then a’ is an [yi:--- ynl. So we may write: 
(2) [zi:--- 3a], — [yi +++? male, 


and define an operation which connects right- and left-ratios. We know from 
§14, that a general characterization of a’ (a any element of L) is obtained, as 
soon as we derive an algebraic characterization of the above [z:---:z,]}. 
We will now find such a characterization of [1:--- Lal, , and show, that it 
justifies the description given in §14. 

In order to do this, we will have to make a rather free use of collineations in 
Q,-1. A collineation is, by definition, a codrdinate-transformation, which 


(3) <j = D> wi Zi forj = 1, ---,n. 


i=2 

Here the w,; are fixed elements of F, and such, that (3) has an inverse. 

(4) y= Z 63; Z;, forz = 1, ---,n, 
j=1 


the 6;; being fixed elements of F, too. (8), (4) clearly mean 





lifk=1 
bu =) : 

Oifk Al 

(5) >» 6:j we; = On, > ij Dik = Six - 
j=1 t=1 
Considering (1) and (5) they imply the contravariant coérdinate-transformation 
for hyperplanes: [y:: - - -: yn]: becomes [1: - - -:Gnl:, where 
(6) yi = Dd yO, forj =1,---,%, 
i=1 

(7) yi = D> Gis, fori = 1,--+-,%. 

7=1 


(Observe, that the position of the coefficients on the left side of the variables in 
(4), (5), and on their right side in (6), (7), is essential!) 


3. We will bring about 


(8) [S12 - += 28g, = [di1: - . - 26inle forz = 1, wee yy Ny 


in 
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by choosing a suitable system of coérdinates, that is, by applying suitable col- 
lineations. We proceed by induction: Assume that (8) holds for i = 1, ---, 
n- - = bs -++ ,m), then we shall find a collineation which makes (8) true for 


i=l,- 


eave the point [61:---:8in), by Pir and the lyperplane [Sa:-++ 28s], by 
h* our assumption on (8) is: p;’ = hj fori = 1, --- ,m—1. Consider, now a 
point a: [z1:---:2,],, and the hyperplane a’: “ie Svand ‘yak. Nowas pr’=h* 
means (use (1)) 2; = 0, and p; < a’ means _ )) y;i = 0. But these two 
aa are equivalent. So we see: If i = 1,---,m — 1, then 2; = 0 and 

; = Oare equivalent. 

"Conder now pa: [Smii-++? :Smn)r Put p,: (yt:---:y*h. As Sms = Ofor 
i=1,---,m— 1, so we have y; = 0 fori = 1,---,m-—1. Furthermore, 


on ot = 0, p, ~ 0,s0 p, not S p,’. By (1) this means y- ¥ 0. 
Form the collineation (3), (4), (6), (7), with 


0, = wy = 1, Oni = Wim = Yn ly fort = m+1,---,n, 


all other 03, OF = 0. 
{One verifies immediately, that this collineation leaves the coérdinates of the 


pi: [5a2-+-28sa), 2 = iy +++, m, invariant, and similarly those of the 
pt [Sat-++t8emh, t= -+ m — 1, while it transforms those of 

"s yi: perce yh 
into [Sma ott %e bmn. 


So after this collineation (8) holds for¢ = 1, --- ,m 
Thus we may assume, by induction over m = 1, --- , n, that (8) holds for all 
i=1,---,n. This we will do. 


The ote argument now shows, that for a: [z1:---:2nj-, a’: [yi?-++?ynl, 
(9) x; = 0 is equivalent to y; = 0, fori = 1,---,n. 
4. Put a: [ai:---tapl-,a’: [yit---tynk, amd b: [&::---t&n)-, 6’: [mi--+ tanh. 
Assume first 91 = 1, 2 = 7,73 = --- = m2 = 0. Then (9) gives & # 0, so 


we can normalize £; = 1, and &3 = £ = 0. & can depend on 72 = 7 


only, 80 2 = fo(n). 

Assume further z; = 1. Then (9) gives y: ¥ 0, so we can normalize y; = 1. 
Now a S Bb’ means by (i) 1 + n%2 = 0, and b S a’ means 1 + y2f2(n) = 0. 
These two statements must, therefore, be equivalent. So if z. # 0, we may put 


= — 23", and obtain yz = — (fe(n))7 = — (fo(— 27). «If zz =.0, then 
ye = 0 by (9). Thus, z2 determines at any rate y2 (independently of 
U3, +++, 2n)iY2 = g2(%e). Permuting the 7 = 2, --- , n gives, therefore: 


There exists for each i = 2, --- , n a function ¢g;(z), such that ys = ¢;(z;). 
Or: 


(10) If a: [l:ze:---:anj-, then a’: [1:ge(r2):-- + 2¢n(%n)h- 
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Applying this to a: [l:xe:---:a,], and c: [l:ui:---:un]j, shows: As a < ¢’ 
and c < a’ are equivalent, so 
(11) >> gi(u:) x; = — 1 is equivalent to > o(zi) uw = — 1. 

t=2 i=2 
Observe, that (9) becomes: 
(12) gi(x) = Oif and only if z = 0. 

5. (11) with v3 = --- = tp = Us = +++ = Un = O Shows: go(ue)x2 = — 1 
is equivalent to go(x2)ue = — 1. If re ¥ 0, we = (—¢o2(x2))-, then the second 
equation holds, and so both do. 

Choose 22, v2 in this way, but leave x3, --- , 2n, Us, +++ Un arbitrary. Then 
(11) becomes: 

(13) >| ¢i(ui) 2; = 0 is equivalent to >> ¢: (ai) us = 0. 
i=3 i=3 
Now put 5 = --- = 2n = Us = --- = Un = 0. Then (13) becomes: 


g3(Us3)%3 + g4(us)24 = 0 is equivalent to v3(x3)us + ga(ra)us = 0, 
that is (for.x,, us ¥ 0): 
(a) xerg’ = gals) ga(us) 
(14) is equivalent to 


(b) ugug* = galas) — g(a). 


Let 14, 23 be given. Choose uz, us4so as to satisfy (b). Then (a) is true, too. 
Now (a) remains true, if we leave uz, u4 unchanged, but change 23, x4 without 
changing z32,'. So (b) remains too true under these conditions, that is, the 
value of gs(x4)—! ¢3(23) does not change. In other words: g(x4)! ¢3(xs) depends 
on xsrq1 only. That is: gs(x4)— gs(rs) = gsa(zsvq"). Put zs = 22, 2% = 4, 
then we obtain: 


(15) g3(rz) = ga(x) Ps4(z). 


This was derived for x, z ~ 0, but it will hold for x or z = 0, too, if we define 
¥34(0) = 0. (Use (12).) 

(15), with z = 1 gives ¢3(x) = ga(X)asa, where a34 = W34(1) ad 0, owing to 
(12) forz #0. Permuting thez = 2, --- , n gives, therefore: 
(16) ¢i(x) = ¢i(X) ai; where ai; # 0. 

(For 7 = j put a;; = 1.) 

Now (15) becomes 
(17) yo(zx) = g2(x)w(z) 


wW(Z) = asga(z)aros. 











e 
ls 
v ’ 


le 





LOGIC OF QUANTUM MECHANICS 841 


Put z = 1in (17), write z for z, and use (16) withj = 2: 
g(r) = Bw(z)y:, — where B, y; ¥ 0. 


8 
(1 ) (8 == g2(1), Yi= Qi2). 


6. Compare (17) forz = 1,2 = u;x =u,z=vj;andz =1,z = wu. 
Then 


(19) w(vu) = w(u)w(r) 
results (12) and (18) give 
(20) w(u) = Oif and only if u = 0. 


Now write w(z), y: for Bw(z)B, By:. Then (18), (19), (20) remain true, (18) 
is simplified in so far, as we have 8 = 1 there. So (11) becomes 


(21) DD wu) ¥2=—-1 
t=2 
(21) is equivalent to 


> w(x) yiui = —1 


t=2 
t = 2, Ue = wand all other z; = u; = 0 give: w(u)y2x = —1 is equivalent to 
w(z)yu = —1l. If x40, u = —yz* w(zx)-, then the second equation holds, 
and so the first one gives: x = —y2' w(u) = —y_'(w(—y2z'w(z)—))-._ But 
(19), (20) imply w(1) = 1, w(w-!) = w(w)-', so the above relation becomes: 
= — 2" (w(—v2w(r7"))7 = —y2'w((— 2" w(z)) >) 
= —¥2'w(w(t)(—2)) = —y¥2'w(—72)w(w(z)). 

Put herein z = 1, as w(w(1)) = w(1) = 1,80 —yz'w(—v72) = 1, w(—2) = —¥2 
results. ‘Thus the above equation becomes 
(22) w(w(x)) = 2, 
and w(—y2) = —ve gives, if we permute the7z = 2, --- ,n, 
(23) w(—vi) = —V%- 

Put u; = —y;' in (21). Then considering (22) and (19) 
(24) > 2; = 1is equivalent to >) w(x) = 1 

i=2 i=2 

obtains. Putze = 2,23 = y,t%4 = 1—2—y,%5= +: = tn =O. Then (24) 


gives w(x) + w(y) = 1 — w(l — x — y). So w(x) + w(y) depends on x + y 
only. Replacing x, y by x + y, 0 shows, that it is equal to w(x + y) + w(0) = 
w(t + y) (use 20). So we have: 


(25) w(x) + wy) = w(x + y) 
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(25), (19) and (22) give together: 
w(x) is an involutory antisomorphism of F. 


Observe, that (25) implies w(—1) = —w(1) = —1, and so (23) becomes 


(26) wyi) = Vi 
7. Consider a: [a1:---: Znj-y a’: [yit---2Ynhk. If 21 ~ 0, we may write 
a: [lizexy!:---22n27"],, and so a’: [1:w(aer{")ye:-+-2w(tetq") yak. But 


w(eiry')¥s = w(xz')w(zi)¥i = w(e1) w(z,) 75, 


and so we can write 


too. So we have 
(27) Yi = w(ri)¥s fort = 1,---,n, 


where the 7; for 7 = 2, --- , n are those from 6., and yi; = 1. And w(1) = 1, 
so (26) holds for allz = 1,--- ,. So we have the representation (27) with 7; 
obeying (26), if x; ¥ 0. 


Permutation of the z = 1, --- , n shows, that a similar relation holds if x, ¥ 0: 
(27+) ¥i= wt(zi)¥;, 
(26*) w(yi) = vi, 


wt(x) being an involutory antisomorphism of F. (wt(zx), y{ may differ from 
w(x), yi!) Instead of y1 = 1 we have now y3 = 1, but we will not use this. 


Put all z;= 1. Then a’: [y::---:yn]: can be expressed by both formulae 
(27) and (27+). As w(x),w+(zx) are both antisomorphism, so w(1) = wt(1) = 1, 
and therefore [yi:---:ynh = [yit--+-tyak = [yt:i---:vth obtains. Thus 


(VIVE = (ni): = Vi, 4F = Yivefori =1,---,n. 
Assume now x2 ~ 0 only. Then (27+) gives y; = wt(z,)y1, but as we are 
dealing with left ratios, we may as well put 


¥i= (vq) wt (a) yt = (vit) wt (2) yt 75. 
Put 6+ = yj ¥ 0, then we have: 
(27++) yi = Bt wt (2) Br y:. 


Put now 2; = #2 = 1,23 = 2, allotherz;=0. Againa’: [yi:---:Yynk can be 
expressed by both formulae (27) and (27+*), again w(1) = wt(1). Therefore 


[yrzyotysiyat- ++ tynk = [yitye:w(x)y3:0:--- 20); 
as [yity22Bt+—wt(x)Btys:0: - - - 20h 


obtains. This implies w(x) = 6+—w(z)B* for all x, and so (27**+) coincides with 
(27). 
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In other words: (27) holds for zz. ¥ 0 too. 


Permuting 7 = 2, --- , m (only ¢ = 1 has an exceptional réle in (27)), we see: 
(27) holds if z; # 0 for? = 2,---,m. For 2; ¥ 0 (27) held anyhow, and for 
somei = 1, --- ,m wemust havez; #0. Therefore: 

(27) holds for all points a: [x1:--+:2,)-. 

g. Consider now two points a: [x,:---:2,), and b: [&:---:&],. Put 
a’: [yit---tYnl, then b S a’ means, considering (1) and (27) (ef. the end of 7.): 
(28) Zz w(ai)yiki = 0. 

i=1 


a <a’ can never hold (af a’ = 0, a ¥ 0), so (28) can only hold for x; = &, 
ifallz; = 0. Thus, 


(29) >, w(x)yix; = 0 implies x) = --- = 2, = 0. 
i=1 


Summing up the last result of 6., and formulae (26), (29) and (28), we obtain: 
There exists an involutory antisomorphism w(x) of F (ef. (22), (25), (19)) and a 


definite diagonal Hermitian form >> w(zx,)viéi in F (ef. (26), (29)), such that for 
i=l] 


a: [zyi--+tap],, b: [E12---2&n],b S a’ ts defined by polarity with respect to it: 
(28) » i w(x)yik = 0. 
i=1 


This is exactly the result of §14, which is thus justified. 
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CARDINAL NUMBERS ASSOCIATED WITH A POINT IN AN 
ABSTRACT SPACE* 


By SeLBy RosBiInson 


(Received June 1, 1935) 


1. Introduction. The fundamental space of this paper, is a Fréchet V-space 
subject to the further restriction that there are no distinct points p and q such 
that q is a point of every neighborhood of p. Our understanding is that the 
Hausdorff Axiom A holds for every V-space, and that every set which contains 
a neighborhood of >: is itself a neighborhood of p. The topological spaces of 
Hausdorff! are not sufficiently general for our purposes, since Theorem 13 states 
that Axiom C holds when certain other properties are present. 

Sections 4, 5, 6, and 7 are devoted to showing that many theorems which were 
first proved under the hypothesis that the first enumerability axiom holds, are 
true more generally for spaces in which every point has a monotone complete 
family of neighborhoods. The knowledge of such theorems led to a search for 
conditions that would imply the existence of monotone complete families of 
neighborhoods. It is shown in Theorem 1 that a necessary and sufficient condi- 
tion for a point p to have such a family of neighborhoods, is the equality of two 
cardinal numbers, which for spaces satisfying the first enumerability axiom 
cannot exceed No. In Section 3 there are various inequalities involving the six 
cardinal numbers considered in this paper. 


2. Definition of the cardinals. The least cardinal which is the power of a 
complete family of neighborhoods of p is designated by x(p), the least cardinal 
which is the power of a monotone family of neighborhoods whose product is 
p by ¥’'(p), the least cardinal which is the power of any family of neighborhoods 
whose product is p by ¥(p), and the least cardinal which is the power of a family 
of neighborhoods of p whose product is not a neighborhood of p by n(p). A 
set X is said to converge to p with respect to power, if for every neighborhood V of 





* Presented to the Society in Chicago, April, 1932. 

1 A Hausdorff space is one satisfying the four axioms: A—Every point p has a neighborhood 
and every neighborhood of p contains p, B—T he product of two neighborhoods of p is a neighbor- 
hood of p, C—The interior of every set is an open set, D—Any two points have neighborhoods 
respectively which are disjoined. Cf. Hausdorff, Grundztige der Mengenlehre, first edition 
(Leipzig, 1914), p. 213. A Hausdorff space need not satisfy the First Enumerability Axiom 
of Hausdorff: x(p) < po for every point p. 

* A complete family F of neighborhoods of p is one such that any neighborhood of p contains 
asa subset a set of F. A monotone family is one such that of any two sets of the family, one 
is a subset of the other. 
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p, |(X — p)-V| > |X-C(V) |.2 Then x(p) is the least cardinal which is the 
power of a set which converges to p with respect to power, and \(p) is the power 
of the least set X such that every neighborhood of p contains a point of X — p. 
For an isolated point p, x(p) = ¥/(p) = ¥(p) = n(p) = 1, while «(p) and A(p) 
are not defined. For non-isolated points p, x(p), \(p), ¥(p), and n(p) are always 
defined. The only possible finite value of n(p) is two, and a necessary and 
sufficient condition for the Hausdorff Axiom B is that n(p) = 2 for no point p. 
If any of the other five cardinals have finite values for the non-isolated point p, 
x(p) = ¥(p), «(p) and ¥’(p) do not exist, and (p) = 2. 


3. Relations among the cardinals. The inequalities, x(p) = X(p), 
(p) = Vp) = Wp) = n(p) and x(p) = d(p), hold for all points p, if we leave 
out of consideration such of the cardinals as may fail to be defined at the point p. 

THEOREM 1. Suppose n(p) exists and does not equal two. Then the following 
conditions are equivalent, and if they hold the six cardinals exist and are equal and 
this cardinal ts regular.’ 

(1). n(p) = x(p). 

(2). There is a monotone complete family of neighborhoods of p. 

(3). There is a complete family F, | F | regular, of neighborhoods of p such that 
every subfamily of power | F |§ 2s also a complete family. 

Proor. Assume (2). The given monotone complete family G has a sub- 
family F consisting of sets W.a(a < Qirl, the least ordinal number of power 
|F |) such that W. contains W, if a < a’ < Qr\, and such that any set of the 
original family contains a set of F’. 

Let the power of F be the minimum possible for such a subfamily. Since 
Q\r| is the least ordinal of power | F |, any subfamily F’ = [W.,] of F, | F’| =| F |, 
will also be of order type Qir|. Then F’ is a complete family of neighborhoods 
of p, since for any W, there is a Wag such that a < ag < Qrl,sothat We D Wag. 
Also | F | is regular. If not, | F | is the sum of y cardinals y; where | F | exceeds 
y and exceeds each p;. Then the sequence We, is a complete family of neigh- 


borhoods of p, contrary to the assumption that | F | is the minimum power for a 
complete subfamily of G. To prove conversely that (3) implies (2), let W: be 





*| X | denotes the cardinal number of the set X, and C(V) = P — V the subset of the 
space P consisting of points not in V. 

‘x(p), x(p) and ¥(p) were introduced by P. Alexandroff and n(p) by J.v. Neumann. See 
Alexandroff, Mathematische Annalen, Vol. 92 (1924), p. 267; Alexandroff and Urysohn, 
Memoire sur les Espaces Compacts, Verhandlungen Kon. A. Amsterdam, 1928; v. Neumann, 
Transactions of the American Mathematical Society, Vol. 37, p. 4. It was found by v. 
Neumann that his work on complete linear spaces could be extended from spaces satisfying 
the first enumerability axiom to those in which n(p) = ¥(p). 

* An irregular cardinal u is one for which there exists a set of v cardinals ua such that 
“= 2ua, ue > wa for every ueandyu>v. The least cardinal » for which this is true is called 
the least regular cardinal associated with yu. 

*| F | designates the power or cardinal number of F. 
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any set of the family F of (3) and W, any set of F contained in lesa’. The 
set W, exists for each a < Qiri, since | F| is regular and therefore exceeds 
>> «<al ’ a | where F. is the subfamily consisting of all sets of F not contained in 
W.. The monotone sequence W, is a subfamily of F of power | F |, and is 
therefore a complete family of neighborhoods of p. To prove (1) implies (2), 
arrange the sets of a complete family of power x in a sequence V, of order 
type Q,. Forevery a < Q,, the set Wa = [].-<aV a. will be a neighborhood of p 
since it is the product of neighborhoods which are less than 7 in number. Then 
W.. is the desired monotone complete family. Given, conversely, a monotone 
complete family, we can select as before a sub-family F = [W.] of regular power 
and order type Qir|. Then || 2 x since F is complete, so it suffices to prove 
|F | S ny. Consider a family G of neighborhoods U of p,|G| < |F |. Since 
F is complete, each set U contains a set W.wv) of F, and 1U D WW.w). Since 
|G| < |F| and |F'| is regular, there is a y less than Qir! such that W, C 
IW.aw) CMU. Therefore| | 27. For each a let x, be a point of Wa — Was. 
Then X = [z,] converges to p with respect to power and x exists. The existence 
of y’ follows form the existence of a monotone complete family of neighborhoods 
of p, since we assumed that no point other than p is in all neighborhoods of p. 
Clearly y’ is always a regular cardinal. So likewise is 7, since from any family 
of power 7 of neighborhoods of p one can by taking products form a monotone 
family of neighborhoods whose product is not a neighborhood. 

Theorem 1 is still true if in property (3) the only subfamilies of F which are 
required to be complete are those formed by omitting from F less than | F | sets. 
It is not known whether or not there can be a family F of irregular power such 
that all subfamilies of power | F | are complete, but if so it can be shown that 7 
is the least regular cardinal associated with | F |. 

THEOREM 2. If x(p) = ¥(p), «(p) exists and does not exceed y(p), and if also 
¥(p) ts irregular x(p) does not exceed the least regular cardinal associated with Y(p). 

This theorem is suggested by theorems 1 and 2 of Alexandroff. It contradicts 
his statement that the equality or inequality of y and x is independent of the 
existence or non-existence of x. The proof is made by arranging the sets of a 
complete family of power y of neighborhoods of p in a sequence V.. of order 
type Qy. Let Wa = [[<,V,. Then no Wa is vacuous but two sets W. may 
coincide. Let a’ be any ordinal less than Q, such that W., # W. for every a 
less than a’. Let & be the order type of the a@’’s. For any point q other than p, 
there is a Vacq), therefore a Wai), Which does not contain g. If y is the least a’ 
such that W, = Waw), W, is a set Wa which does not contain g. Then 
IIV.. = p, therefore there are y sets W. and é is coterminal with Q,. Let 7 
be the least cardinal such that there is a set of ordinals ag of order type , such 
that every az is an a’ and such that for any a there is a 6 for which a < a’ < 
ag <Q,. Then» = y if y is regular, otherwise v is the least regular cardinal 
associated with y. Let xg be a point of Was — Wag,,- The set X = [z)] 
converges to p with respect to power, since any neighborhood V, contains a set 
Wa, and |X — Va| S$|X — Wa |< |X-Wa,| S| X-Val. 
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The following theorem is analogous to the equivalence of properties (1) and 
(3) of Theorem 1, \(p) playing the réle previously taken by n(p). 

TurorEM 3. If there is a complete family F of regular power such that the 
product of the sets of each subfamily of power | F | is p,|F | = x(p) = dep). If 
F | is irregular, \(p) 7s not less than the least regular cardinal associated with | F |. 

Proor. Let p be a point of accumulation but not an element of the set 
X = |z]. Let F, be the subfamily consisting of the sets of F which contain zx. 
Then | Fz | < | F| for every x Since |F| = pe \ equals | F | if | F | is 
regular and is not less than the least regular cardinal associated with | F | if | F | 
is irregular. 

THErorEM 4, If the family F of Theorem 3 exists and |F | = y(p), «(p) 
exists and is regular and equals \(p). If | F | is regular 


|F | = x(p) = ¥(p) = x(p) = Ap), 
while if | F | ts crregular x(p) = X(p) is the least regular cardinal associated with 
|F | = x(p) = ¥(p). 

Theorem 4 is an immediate consequence of Theorems 2 and 3. 

Lemma. Ifa set X converges to p with respect to power, ¥(p) is not less than | X | 
or the least regular cardinal associated with | X | according to whether | X | is 
regular or trregular. 

TuHeorEM 5. If F is the complete family of neighborhoods of p of regular power 
of Theorem 8, a necessary and sufficient condition that | F | = x(p) = ¥(p) ts that 
x(p) exists and ts regular. 

Proor. Theorem 5 follows directly from Theorem 4 and the Lemma. Con- 
sider a set X which converges to p with respect to power and a family F of power 
v of neighborhoods V of p such that IV = p. Then |X |= 2|X-C(V) | and 
‘X| > |X-C(V)| for each V. The lemma follows immediately. It was 
proved by Alexandroff and Urysohn (loc. cit.) for | X | regular. 

Properties (1) and (2) of Theorem 1 are equivalent to the property, for any 
family F of power x of neighborhoods of p there is a monotone family H of 
neighborhoods such that each set of F contains a set of H. These three proper- 
ties are analogous respectively to the three properties of Theorem 6 below. 

THEoREM 6. If ¥(p) = n(p), ¥/(p) exists and equals ¥(p). A necessary and 
sufficient condition that ¥(p) = n(p) for a non-isolated point p, is that for any family 
F of ¥(p) neighborhoods of p there is a monotone family H of neighborhoods such 
that each set of F contains a set of H. 

The theorem holds if H is replaced by a descending sequence S of order type 
Q, the product of whose sets is p. If y = 7 there is a sequence V. of order type 
2, of neighborhoods of p such that IV. = p. The existence of y’ is established 
by considering the monotone sequence of sets Wa = Ig<a Vs. The existence of 
the monotone family H is likewise established by taking products, but now this 
process is applied to the sets of the family G@’ consisting of all sets of the given 
family F and also all sets of some family G of ¥ neighborhoods of p whose product 
isp. To prove conversely that y = 7 we show that if | F | < the product of 
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the neighborhoods which constitute F is also a neighborhood. Form the family 
G’ consisting of sets of F and sets of a family G selected as before. By hypothe- 
sis there is a monotone family H whose sets are contained in all sets of G’. For 
each set V of F choose a set of H contained in V and call the resulting subfamily 
Hy. Then there is a set of H contained in the product of the sets of Hr, there- 
fore in the product of the sets of F. Otherwise, there would be a set of Hy in 
each set of H, and hence Hr would be a family of power less than y of neighbor- 
hoods of p whose product is p. 

Theorem 7 below is obviously true, while Theorem 8 is an immediate conse- 
quence of Theorems 7 and 2 and the lemma which precedes Theorem 5. 

TuHEoREM 7. If ~'(p) exists, ¥'(p) S Ap). 

THEorREM 8. If y'(p) and x(p) exist and the latter is regular, 


Ap) = K(p) = v(p) = ’'(p). 


If V(p) = x(p) and p'(p) exists, (p) = «(p) = ¥(p) = W’(p) = x(p), and this 
cardinal is regular. 

If there exist sets X of regular cardinal number which converge to p with 
respect to power, A S | X | S y, hence all such sets X are of power ) if y’ exists 
or if the conditions of Theorem 5 hold. If there are monotone families of neigh- 
borhoods of p the product of which is p, all such families which have no sub- 
families of lower power with the same properties, will be of power not greater 
than \. Hence all such families will have the same cardinal number if « exists 
and is regular. 


4. An equivalence. I proved’ for a space satisfying the first three axioms 
of Riesz and the first enumerability axiom, that the L-spaces are exactly those in 
which the fourth axiom of Riesz holds and are also exactly those in which pro- 
perty D of Hausdorff is valid. In Theorem 9, the first enumerability axiom 
(x(p) = n(p) = No for all points p) is replaced by condition x(p) = n(p) for all 
points p. Since an L-space is one in which the topology can be defined by the 
convergence of enumerable sequences, it has n(p) = (p) = x(p) = No for 
every non-isolated point p. Hence in considering spaces for which x(p) = n(p), 
it is necessary to generalize the L-space, and this is done in two ways in proper- 
ties III and III’. It is also desirable to consider besides the fourth axiom of 
Riesz, Property I, a slightly stronger property, I’. 

I. Any set Z such that E’ (the set of all limit points or points of accumulation 
of Z) contains the points p and q, has a subset X such that X’ contains one but 
not both of the points p and q. 
‘ I’. Asset E such that E’ D p + q, has subsets X and Y such that X’ contains 
p but not g and Y’ contains q but not p. 

II. Any two points have neighborhoods without common points. 

III. A set can converge with respect to power to at most one point. 





’ Bulletin of the American Mathematical Society, Vol. 37 (1931), p. 628. 
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Ill’. A set # having p as a limit point, has a subset X such that X’ = p. 

In any V-space II implies III and III’ implies I’ which implies I. If n(p) = 2 
for no point p, II implies I’. 

THEOREM 9. If x(p) = n(p) ¥ 2 for every non-isolated point p and has the 
same value for all such potnts, all the above properties are equivalent. 

Proor. Except for III’, the theorem can be stated and proved for two fixed 
points p and q. If the points p and gq of E’ are contained respectively in dis- 
joined sets V and W, p and q are respectively limit points of V-E and W.-E. 
Conversely, if there are no disjoined neighborhoods of p and q respectively, we 
shall exhibit a set X = [z.] which converges to both p and q with respect to 
power and is such that any subset of X which has one of the two points as a 
limit point has. the other also as a limit point. Let V. and W. be monotone 
complete families of neighborhoods of p and q respectively of order type Q,. 
Let x. be any point of Vz. W.. To show that II implies III’, consider the above 
family V, of neighborhoods of p and select a point z. in each Vz. Any other 
point q is not a limit point of X. For there will be neighborhuods V, and W of 
pand q which are disjoined. Then q is not a limit point of the set X-V. which 
has no point in W, and is not a limit point of X — V, which is a set of power 
less than x = A, hence q is not a limit point of X. 


5. A sufficient condition for the first enumerability axiom. For an 
L-space P satisfying axiom C of Hausdorff, a necessary and sufficient condition 
that every monotone descending sequence of closed sets be enumerable is that P 
be self-condensed. This theorem was proved by Sierpinski and generalized by 
R. G. Putnam and by myself.? Substituting any regular cardinal u for No, a 
necessary and sufficient condition that P be y-self-condensed is that every 
descending sequence of closed sets be cf power less than ». Furthermore, it is 
not necessary that P be an L-space or that axiom C hold. But if axiom C does 
not hold, one replaces sequences of closed sets by monotone sequences such that 
for two sets V and W of the sequence, V > W implies that there is a point of V 
which is not in W = W + W’. Suppose that x(p) = y¥(p) is a regular car- 
dinal and that for any other point x there are disjoined neighborhoods of p 
and z. Consider a sequence V, of order type Q, of neighborhoods of p. Let 
Wa= Ilyr<q Var. Select as follows from the sets W., a subsequence of distinct 
sets Was of order type Q, such that Wag contains a point zg not in Wag... 
let xs be any point in Was, and let W and U be disjoined neighborhoods of x 
and p respectively, and let ag41 be an ordinal such that U D Vag,, D Wag,,- 
This ordinal exists since V. is a complete family. For any limit ordinal 8, let 








*A space P is self-condensed if every non-enumerable subset X of P contains a point z 
such that | X-U | is non-enumerable for every neighborhood U of z. The point z isa 
condensation point of X, whereas if | X-U | = | X |, z would be a nuclear point of X. 

*W. Sierpinski, Fundamenta Mathematica, Vol. 2 (1921), p. 179; Putnam, Bulletin 
American Mathematical Society, Vol. 36 (1930), p. 653; Robinson, loc. cit., p. 629, and 
American Journal of Mathematics, Vol. 55, p. 433. 
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ag be an a such that Wag C Iy<agW,. But if P is u-self-condensed the powers 
of the sequence W.g, which equals x(p) and y(p), will be less than y, and the 
following theorem is proved. 

THEOREM 10. If a self-condensed space P satisfying axiom D of Hausdorff, 
has x(p) equal to Y(p) and either regular or finite for every point p, P satisfies the 
first enumerability axiom. More generally, if P is y-self-condensed and satisfies 
axiom D, x(p) < wat points p for which x(p) equals ¥(p) and ts regular. 

Obviously, a space satisfies the first enumerability axiom if it has an every- 
where dense subset which is enumerable, and x(p) equals (p) or d(p) at every 
point p. 


6. Regularity. A regular Hausdorff space is a Hausdorff space in which 
if Z is a closed set and p a point not in E£, there are disjoined open sets containing 
E and p respectively. We define a regular V-space as one in which E ) p 
implies that there are disjoined sets to which E and p are interior. An equiva- 
lent definition is, a space in which if p is interior to V there is also a set W to 
which p is interior such that W CV. Any point p which has this property is 
valid is called a regular point, regardless of whether the space is regular. 

Our modification of the definition of regularity, is due to the fact that closed 
sets do not play the same réle in general V-spaces as in spaces satisfying axiom C, 
and it is analogous to our modification in Section 5 of the property concerning 
the power of monotone descending sequences of closed sets. However, the 
definition of regularity given above for Hausdorff spaces, can be applied to 
general V-spaces, and it yields a property which is weaker than regularity. It 
may be regarded as regularity in a new space (P, O) satisfying axiom C, the 
neighborhoods in (P, O) being only the sets which were open in the original space 
P.° It is equivalent to the property for any open set V and point p of V, there 
is an open set W and a closed set Z such that V > Z DW Dp. Anumberof 
non-equivalent definitions of regularity could be secured by making V or W or 
both neighborhoods instead of open sets, and either using the closed set Z or the 
condition W C V. 

A locally compact regular Hausdorff space satisfies the first enumerability 
axiom. From that and another theorem of Alexandroff and Urysohn (loc. cit., 
p. 66) it follows that, given a locally compact Hausdorff space P in which 
¥(p) < No for all points, a necessary and sufficient condition that P is regular is 
that it satisfy the first enumerability axiom. The three following theorems 
generalize these results and make use of new properties C¥’ and C,,. 

A family F of sets is said to be a proper covering of the set V if every point of V 
is interior to some set of F, a simple covering if every point of V is contained in 
some set of F. The family F is reducible to a proper or a simple covering of V 
of power », if there is subfamily of F of power v which constitutes a proper or a 





10 E. W. Chittenden, Transactions of the American Mathematical Society, Vol. 31 (1929), 
p. 296. 
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simple covering of V. Let u(p) and »(p) be functions of the point p whose 
values are cardinal numbers. If p has a neighborhood V such that every proper 
covering of P of power y(p) is reducible to a proper (simple) covering of V of 
power less than »(p), pis said to have property C*’ (C,,). A space has property 
(* (C,,) if each point p has property C’ (C,,), the cardinals u and »y being in 
general different for different points. 

THEeorEM 11. If n(p) # 2, p has property Cx", and for every other point q 
there are disjoined neighborhoods of p and q; p is a regular point. A space P which 
has property Cx" and properties B and D of Hausdorff, is regular. 

Proor. Let the sets V. constitute a complete family of neighborhoods of q¢ 
of power x(p). For each point q of V-E, let V. and W, be disjoined neighbor- 
hoods of p and q respectively. For each a, form a new set W, by lumping 
together all sets W, which contain no points of V.. These sets W. together 
with C(Z) and C(V) form a family F of power x(p) which properly covers P, 
hence has a subfamily F’ of power less than n(p) (consisting of C(Z) and certain 
sets W. which we designate by W.s) which properly covers V. The product 
V.1I Vag is the desired neighborhood of p, while the desired set to which E is 
interior is C(V) + 2 Wag. 

THEOREM 12. If ~’(p) exists, n(p) ¥ 2, and the regular point p has property 
Cyy, then x(p) = ¥'(p). If a regular space P has properties B and Cy-y, and 
v'(p) exists for all points p, then x(p) = W'(p) at all points p. 

Proor. Let W, be a monotone descending sequence of neighborhoods of p 
of order type Qy, II W. = p. Since p is a regular point and n(p) ~ 2, we may 
assume that Wai: C Wa CV. Toshow that the sets W, constitute a complete 
family, we prove that an arbitrary neighborhood U of p contains a set W,. 
The family F of power y’(p) composed of U and the sets C(W.) properly covers 
P, hence has a subfamily F’ of lower power which simply covers V. Then F’ 
simply covers P, since C(V) C C(W.) for every a. As |F’| < y'(p) and 
V'(p) is regular, there is a fixed ordinal y such that a < y < Qy for every a such 
that C(W.) is a set of F’. Therefore the set U + C(W,) contains all points of 
P,and U contains W,. This theorem is clearly true if one substitutes | F | for 
¥'(p), where | F'| is any regular cardinal for which there exists a monotone 
descending sequence of order type Q, » ; of neighborhoods of p whose product is p. 
Furthermore, the hypothesis n(p) # 2 could be omitted if one assumed that 
V=P. Finally, the first corollary below states a stronger result which holds 
only when there is reducibility to finite coverings. The other corollaries follow 
immediately from Theorems 11 and 12. The three corollaries also hold for 
single points. 

Corottary. In a space which has properties B and D of Hausdorff and 
property CyS&o, x(p) = ¥(p) for each point p. 

Corottary. If a space has properties B and D and Cx" and y'(p) exists for 
each point p, x(p) = W'(p) for each point p. 

Corottary. If a space P has properties C’¥ and B and ¥(p) = n(p) at each 
point, a necessary and sufficient condition that P be regular, is that property D hold 
and x(p) = ¥(p) at each point p. 
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7. Axiom C. In a previous paper we proved that a locally bicompact space 
which has properties B and D also has property C." We stated that the theo- 
rems of that paper hold, if one replaces the hypothesis of reducibility of coverings 
of all infinite powers by the first enumerability axiom and the reducibility of 
enumerable coverings. The latter theorems can be generalized as shown by the 
two examples which follow. The proofs are like that of Theorem 11. 

THEOREM 13. If a space P has property D and each non-isolated point p has a 
neighborhood V such that every proper covering of P of power x(p) ts reducible to a 
proper covering of V of power less than n(p), P is a regular Hausdorff space. 








THeoreM 14. A set E is closed, if for each point p of C(E) every proper covering 
of E of power x(p) is reducible to a simple covering of power less than (p). . 
Tue COLLEGE OF THE CiTy OF NEw YorK. 0 
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11 Bulletin of the American Mathematical Society, Vol. 39 (1933), p. 599. i 
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ELEMENTARY PROOF OF THE SPECTRAL THEOREM 


By B. A. LENGYEL AnD M. H. Stone 


(Received December 11, 1935) 


1. Introduction. Proofs of the fundamental theorem on the spectral resolu- 
tion of self-adjoint operators in Hilbert space have been given by Carleman [1], 
Hellinger [2], Hilbert [3], Koopman and Doob [4], von Neumann [5], Riesz [6], 
Stone [7], Wecken [8], and Wintner [9], using a great variety of methods. Some 
of these methods apply only to bounded operators, while others are suited to the 
general case. The resolution for bounded operators yields the resolution for non- 
bounded operators as indicated by von Neumann [5], Riesz [6], and Riesz and 
Lorch [10]. Every proof of the fundamental theorem so far devised makes use 
of some part of the theory of functions. Thus Carleman, Hilbert, Stone, and 
Wintner employ the selection principle of Helly to pass by approximation from 
the familiar case of finite Hermitean matrices to the general case. Carleman, 
Hellinger, Koopman and Doob, and Stone use facts about the representation of 
analytic functions by Stieltjes integrals. The procedures of von Neumann and 
Riesz repose upon the Weierstrass approximation theorem. Finally, the proof 
of Wecken involves the theory of the power series representation of the function 
v(1 — 2). 

We shall now give a new proof of the spectral theorem which is strictly ele- 
mentary in the sense that it depends only upon intrinsic properties of Hilbert 
space and of the real number system. Our method may be regarded as a varia- 
tion on the extremal principles devised by Schmidt [11] in the theory of integral 
equations and applicable to the case of totally continuous self-adjoint operators 
in Hilbert space. It is well known that these extremal principles do not suffice 
to resolve even bounded self-adjoint operators. Hence it is not to be expected 
that our method will exhibit a very close technical relation to Schmidt’s. 

In the present paper we shall be concerned primarily with bounded self-adjoint 
operators. Indeed our method, which requires the study of the powers of an 
operator, is not suited to the case of non-bounded operators. To discuss that 
case we must appeal to other devices. In a later paper, one of us (Stone) pro- 
poses to give a new treatment of several branches of operator theory starting 
from the results given explicitly in these pages. In particular, the_entire spectral 
theory will be developed on this basis. For the purposes of this projected paper, 
the Stieltjes integral representation of a bounded self-adjoint operator may be 
regarded as a digression, since*all the information which it provides may be 
deduced equally well from the approximation theorem given in the last section 
below. 

Since the dimensionality postulates occurring in the definition of Hilbert space 
are actually so little used in the theory, it is convenient to suppress them. Thus 
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we shall consider those spaces which are called, following Léwig [12], complex 
Euclidean spaces. 


2. Complex Euclidean Space. A complex Euclidean space & is defined as a 
class of abstract elements subject to the postulates A, B, and E of von Neumann 
[5, 7]; that is to say, it is a complex vector space or module which possesses a 
metric derived from a complex-valued Hermitian symmetric bilinear form or 
inner product and which is complete with respect to metric convergence. If 
has a finite basis with respect to linear combination, it is said to be a unitary 
space [7]; and, if % has no such basis but is separable, it is said to be a Hilbert 
space [5, 7]. The geometry of % is essentially independent of such restrictions 
on its dimensionality; the cardinal number of a complete orthonormal set in { 
may, however, exceed N). The elementary parts of operator-theory are likewise 
essentially independent of dimensionality restrictions. In this connection, it is 
important to note that the Fréchet-Riesz theorem in Hilbert space, according to 
which a continuous linear functional L(f) necessarily has the form L(f) = (f, 9), 
has been extended to complex Euclidean space by Léwig [12] and Riesz [13]. 

We shall find use for two sorts of convergence in a space {—namely, strong 
convergence and weak convergence. A sequence {f,} is said to converge strongly 
to the limit f if | f — f, | tends to zero with 1/n; and is said to converge weakly to 
the limit f if (f — fn, g) = (f,9) — (fn, g) tends to zero with 1/n for every g in 
%. While the properties of these two types of convergence are generally known, 
we may recall the properties of weak convergence on which we rely in the 
sequel: (1) if {f,} converges weakly to f, then the sequence {| f, |} is bounded 
[14] and | f | < lim inf,_.. | fn |; (2) if {| f, |} is bounded, then {f,} contains a sub- 
sequence converging weakly to a limit f; (3) if the sequence {f,,} lies in a (strongly) 
closed linear manifold J2 and converges weakly to f, then f lies in MN; (4) ina 
unitary space, weak and strong convergence are equivalent; (5) if {f,} converges 
weakly to f and {g,} strongly to g, then (f,, gn) tends to (f, g). 

We shall also require certain information concerning operator-sequences. 
The basic result reads as follows: if A,, B,, A, and B are linear operators defined 
over % and subject to the conditions 


(1) |A.f| S alf\|; (2) A,f — Af strongly; 

(3) | B.f| S$ B\f\; (4) B,f — Bf strongly ;— 
then the sequence {A,B,} has the properties 

(5) |A,Bf| S af |f |; (6) A,B,f — ABf strongly. 


A proof is easily supplied on the basis of the inequalities 
|AB.f| S a|B,f| S oB |f\, 
| ABf — A,B,f| S | A(Bf) — A,(Bf) | + | An(Bf — Buf) | 
< | A(Bf) — A,(Bf) | + a | Bf — Bf. 














ELEMENTARY PROOF OF SPECTRAL THEOREM 855 


The extension of this result by induction to the case of any finite number of 
operator-sequences is evident. 


3. The Bounds of Linear Symmetric Operators. It will be convenient to 
discuss in some detail the not unfamiliar facts concerning the bounds of linear 
symmetric operators. An operator A is said to be symmetric if its domain 
generates a closed linear manifold coincident with % and if (Af, g) = (f, Ag) for 
all f and g in its domain. Throughout this section we suppose that A is both 
symmetric and linear; but we do not require that it be continuous. For such 
an operator the inner product (Af, f) is real; and its greatest lower bound a and 
least upper bound 6 under the condition | f | = 1 are the same as the correspond- 
ing bounds of the quotient (Af, f)/(f, f) where f # 0. We admit the cases 
a= —%*,8 = +o to consideration; and we point out explicitly that when & 
consists of the null element alone the numbers a and £ are not defined. We call 
a and 8 the bounds, lower and upper respectively, of the operator A. The fol- 
lowing properties of the bounds are now evident: 

THEOREM 1. If A has bounds a and 8B, then A + I has bounds a + and 
8 + d for arbitrary real d; tf p ts real and positive, p A has bounds pa and pB; and 
—A has bounds —B and —a. 

We recall also the well-known result: 

THEOREM 2. The least upper bounds of the ‘titi (Af, g)/|f\\g| and 
|Af |/|f |, where f ¥ 0 and g ¥ 0, have max (| a|, | 8 |) as their common value 
when A has bounds a and B. 

It will be useful in the sequel to know under what conditions the bounds of an 
operator are attained. We have: 

THEOREM 3. In order that a bound of A be attained, it is necessary and suffi- 
cient that it be a characteristic value of A; and the elements for which the bound is 
attained are precisely the corresponding characteristic elements of A. 

In proving this theorem, we first note that an infinite bound is never attained. 
Thus Theorem 1 permits us to reduce the consideration of an attained bound to 
the case of an operator A with attained lower bound a = 0. We thus suppose 
that for all f in the domain of A we have (Af, f) = 0, while for some g in that 
domain we have (Ag, g) = 0,|g| = 1. Iftis an arbitrary complex number and 
han arbitrary element of the domain of A, we put f = g + lh in the inequality 
above and obtain the new inequality 


(Ah, g) + U(Ag, h) + [UP | AP 20. 
We now restrict | to have the form | = (Ag, h), where d is real. By virtue of 
the relations (Ah, g) = (h, Ag) = (Ag, h), our inequality becomes 
| (Ag, h) PAZ + AlAP) ZO. 


Since is arbitrary, we must have (Ag, h) = 0; and since h is arbitrary in the 
domain of A, we must even have Ag = 0. Thus g is a characteristic element of 
A corresponding to the characteristic value zero. It is trivial that such an ele- 
ment is one for which the lower bound of A is attained. 
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If E is a projection permutable with A in the sense that, when Af exists, AEf 
also exists and is equal to EAf,—then A may be considered in terms of its 
behavior in the ranges Jt and Jt of H and J-E respectively. It is a linear 
symmetric operator in each of these closed linear manifolds. It must be noted, 
of course, that the elements Ef, where f is in the domain of A, generate the 
closed linear manifold Nt: for if g is in Mt, then (Zf, g) = O implies (f, g) = 
(f, Eg) = (Ef, g) = 0 and hence g = 0. Consequently we may refer to the 
bounds of A in Nt and in YM respectively. Since the domain of A consists of all 
elements f + g where f and g are elements of the domain restricted to lie in 2 
and in Jt respectively, and since 


(Afi+9),f+9) = (4,,f/) + (4¢9,|\f+oP =IFP +197, 


we see that the bounds of A in & are obtained by taking the maximum and the 
minimum of its bounds in Nt and N respectively. 


4. Spectral Theory for Bounded Definite Self-Adjoint Operators. Our 
method of constructing the spectral theory is adapted to the case of self- 
adjoint operators—that is, the linear symmetric operators characterized by the 
identity A = A*—for which the bounds a and 8 satisfy the inequalities 


OSaZSB<4+ 0. 


Since the apparently more general case where a is restricted only to be finite is 
easily reduced by virtue of Theorem 1 to the one under consideration, we do not 
actually lose by the supposition that a 20. The details of the general case 
will be discussed in the following section. We recall that a self-adjoint operator 
with finite bounds is defined over the entire space &. 

We state the spectral theorem for the indicated case in the following terms: 

THEOREM 4. If H is a self-adjoint operator with finite upper bound and non- 
negative lower bound, then the class (A), — 2 << + ©, of all elements in ¥ 
for which the condition 


(1) |H*f| = |f| fork 21, 
or, when d > 0, the equivalent condition 


(2) lim sup | H*f|/A*¥ < +0, 
ko 


is satisfied, is a closed linear manifold; its projection operator F(X) is permutable 
with H and also with every bounded linear operator which is permutable with H; and 
in IN(A) the upper bound of H does not exceed ». In the range M(r) of I — FO) 
the lower bound of H is not less than d; and, if it is equal to d, is not attained. The 
inequality 4 < d implies the equivalent relations M(u) C Mr), F(w)F(A) = F (u); 
the inequality \ < 0 implies the equivalent relations I(x) = O, F(A) = O; and the 
inequality » = B, where B is the upper bound of H, implies the equivalent relations 
MA) = L, F(A) =I. Finally, when ¢ > 0 tends to zero, F(A + 6)f tends strongly 
to F(A)f for every f in &. 
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We shall break the proof of this theorem into several parts. 

First let us dispose of the concluding portions of the theorem, which are all 
elementary consequences of condition (1). If \ < 0, the inequality | /f| < 
\|f| of the condition (1) implies f = 0; and we therefore have M(A) = ©. 
Thus in considering the inequality 1» < \ we may suppose further that » = 0: 
and we then see that, if f lies in M(u), the inequalities | H*f | < u* |f| < | f| 
are valid for k 2 1 and imply that f lies also in M(A). If A = 8, where 8 is the 
upper bound of H, we note that | H*f| < 6 | H*-~| < x | H*-4| fork = 1 and 
for every f in % in accordance with Theorem 2; and we then easily conclude that 
MA) = L. Obviously, if | H*f| < (A + ©)* |f| fork = 1 and for every « > 0, 
the element f has the property that | H*f| < * |f| for k = 1; in other words, 
the manifold $(A) not only is included in all the manifolds M(A + «) for « > 0 
but actually coincides with their intersection. On well known grounds {7}, 
this geometrical property of the manifolds Nt(A) is equivalent to the property 
that the corresponding projections F(A) satisfy F(A + «)f — F(A)f strongly for 
all f in £ when ¢ tends to zero through positive values. It will be observed that 
the family of projections F(A) thus enjoys all the characteristic properties of a 
resolution of the identity [7]. 

We are thus left to prove the first two statements of the theorem. We show 
next that it is sufficient for us to consider the special case \ = 1. The case 
where X < 0 is trivial since I2(A) = O, F(A) = O as we have already remarked. 
The case where \ = 0 is also trivial: in this case the condition (1) reduces to Hf = 
0, so that (0) either is the characteristic manifold of H corresponding to the 
characteristic value 0 or again reduces to ©. Thus the assertions of the theorem 
concerning 92(0) and F(0) are easily verified. First, if JT is a bounded linear 
operator defined over % and permutable with H, then T* has like properties 
since T*H = T*H* = (HT)* = (TH)* = H*T* = HT*; and T and T* both 
transform 92(0) into a part of itself since Hf = 0 implies HTf = THf = 0, 
HT*f = T*Hf = 0. On well known grounds [7], this geometrical property of 
M0) and T is equivalent to the relation TF(0) = F(0)T. In particular, we 
may take T = H. Secondly, the lower bound of H in (0), the orthogonal 
complement of (0), cannot be less than the lower bound of H in % and thus 
cannot be less than zero; and Theorem 3 shows that if this bound is equal to zero 
it cannot be attained. Having disposed in this manner of the cases \ < 0, we 
turn to the case \ > 0. Here we see that the first two statements of Theorem 4 
are unchanged in meaning and content if we write H/d for H and everywhere 
else put \ = 1: for permutability with H/) is equivalent to permutability with H, 
and the bounds of H/) are related to those of H by Theorem 1. Since the corre- 
spondence H/ <> H does not alter the class of operator under consideration, we 
are thus enabled to confine our attention, when \ > 0, to the single special case 
h=1, 

We are therefore left to prove the first two statements of Theorem 4 with 
\ = 1. We now separate these statements and prove them consecutively as 
Theorem 4a and Theorem 4b. 
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THrEoreM 4a. If H has the properties assumed above, then the class M of all 
elements in % for which the equivalent conditions 


(1) |H*f| = |f\|fork 21, 
(2) lim sup | H*f| < +o, 


are satisfied is a closed linear manifold; its projection operator F is permutable with 
H and also with every bounded linear operator which is permutable with H; and in 
IM the upper bound of H does not exceed 1. In the range NR of I — F the relation 
lim supz—eo | H*f | = + © holds whenever f ¥ 0. 

If f is any element of %, then o(f) = lim supx... | H*f | exists and satisfies the 
inequalities 0 < o(f) S$ +. Itis also evident that o(0) = 0, o(af) = |a|o(f) 
when a ~ 0, and o(f + g) S o(f) + o(g). Hence the inequality o(f) < + 
specifies a linear manifold 2%. If 7’ is a bounded linear operator defined over { 
and permutable with H, the relations | Tf| < y |f|, | H*Tf | = | TH*f| < 
y | H*f | show that o(Tf) S yo(f). It follows that every such operator T' trans- 
forms 9t into a part of itself. 

We now observe the relations 


0 < |)? = (Af, H*f) = (H*-Y, H*+f) < | Hf | | HY |, 


holding for k = 1 and for every f in &. They show at once that, if | H?f| = 0 
for some p 2 2, then also 


|Hf|=|A¢|=.-- =|Af| =0. 
Consequently we can throw the above inequalities into the form 
| Hf |/|H*"f| = | AY |/| A'S, k 21, 


whenever Hf ~ 0. They then show that the sequence { 7:(f)}, where 7:(f) = 
| H*f |/| H*"f |, increases monotonely to a positive limit r(f). When Hf = 0 
we define 7(f) = 0. 

In order to establish the equivalence of conditions (1) and (2) we next investi- 
gate the relations between o(f) and 7(f). If 7(f) S 1, we have 


[AP I/If| = n(f)--- nf) S W(f) S 1, | Hf | Ss |f| 


fork 21. It follows that o(f) < 1 and hence that o(f) < +o. If7(f) > 1, 
there exists a real number p such that r(f) > p > 1 and an integer p such that 
tf) > pfork > p. Thus we have 


| H+? f|/| H?f| - To+i(f) chat tr+0(f) > p”, 
| H™+?f| > p| Hf | 
forn 2 1. Since Hf ¥ 0, it follows that o(f) = +. Thus we see that the 
inequalities 7(f) < 1, r(f) > 1 are equivalent respectively to the relations 


o(f) < +, o(f) = +; and also that o(f) < + © implies o(f) S$ 1. Further- 
more we see that o(f) < + if and only if | H*f| < |f|fork 21. 
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Since H* is a continuous operator, the class of elements specified by the in- 
equality | H*f | = | f | is necessarily a closed subset of %. Hence the class speci- 


fied by the simultaneous inequalities | H*f | < | f |, k = 1, being the intersection’ 


of closed sets, is itself a closed subset of &. We thus conclude that the linear 
manifold Mis closed. Now we have noted above that a bounded linear operator 
T permutable with H transforms 2M into a part of itself. Since 7'* is then also 
permutable with H, we see that T* likewise transforms 2 into a part of itself. 
In consequence, the projection operator F of I is permutable with 7. In par- 
ticular we may take T = H. Since by condition (1) the inequality | Hf | < | f | 
holds throughout 9, we see by Theorem 2 that the upper bound of H in I does 
not exceed 1. 

The manifold Jt described in the statement of the theorem is the orthogonal 
complement of 2% and thus has no element except the null element in common 
with M@. Hence, if f lies in Jt and f ¥ 0, we have o(f) = +. 

We may remark that in the proof of Theorem 4a we have not made any 
essential use of the definiteness of H. This assumption comes into play in the 
following theorem. 

THEOREM 4b. With the assumptions and notations of Theorem 4a, the lower 
bound of H in NR is not less than 1; and, if equal to 1, is not attained. In other 
words, if lim supe | H*f | = + for every f ¥ 0 in a space N, then (Hf, f) > 
(f,f) for every such f. 

Clearly we may restrict our attention to the behavior of H in 2. We know 
that lim sup... | H*f | = + for every f ~ 0in 9. We denote by a the lower 
bound of H in 9%. By our assumptions on H we know that a 20. We wish to 
prove thata 21. In reaching this result, we must proceed by cases. 

We begin with the case where 9 is a unitary space. The trivial case where Jt 
consists of the null element alone may be excluded. Then, by definition, there 
exists a sequence {f,} such that (Hfn, fn) -~ a, |fn| = 1. Since the sequence 
\|fn |} is bounded, there exists a subsequence of {f,} which converges weakly to a 
limit f. The fact that Jt is unitary implies that the convergence is strong as well 
as weak. Hence passage to the limit yields the relations (Hf, f) = a, |f| = 1. 
Since the bound a is attained, we conclude that Hf = af in accordance with 
Theorem 3. Thus | H*f| = a*|f| fork = 1so that o(f) = + implies a > 1, 
as we wished to prove. 

We proceed next to the case where Jt is a Hilbert space. If € is a positive 
number, then by definition there exists an element ¢; in Jt such that (H¢:, ¢:1) S 
a+ e,|¢:| = 1. We form a complete orthonormal set {¢n} in Jt with ¢ as its 
first element. The projection of Jt on the closed linear manifold generated by 
¥1,--+, gn will be denoted by E,. We consider the self-adjoint operator 
H, = E,HE,/(a + 2¢). Since (E,HE,f, f) = (HE, f, E,f) = 9 for all f in %, 
the lower bound of H, is not less than zero. If 8 is the upper bound of H in ¥, 
we have | E,HE,f | $ | HEnf| < B| Exf| S 8|f| and hence 


| Hnf | = (8/(@ + 2¢)) If. 
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We can therefore apply Theorem 4a to the operator H, , constructing the closed 
linear manifold I, specified by the conditions | H*f| < |f|, k = 1, the or- 
thogonal complement ¥t,, and the projection operator F,, as described in 
the theorem. If f is in the range of the projection J — E,, we have H,f = 
H,(U — E,)f = E,HE,U — E,)f/(a + 26) = 0. Thus 2M, contains the range 
of I — E,; and MN, is contained in the range of E,. Since the range of E, is a 
unitary space by definition, the manifold Yt, is a unitary space also. It follows 
that the results of the preceding paragraph are applicable to H,: in particular, 
the lower bound of H, in MN, is not less than 1. If we make use of the permut- 
ability of the projection F,, with H,, we see that 


(A, = F,)¢1, I a F,,)¢1) ame (Angi, ¢1) — (HF a¢i, Figs) S (Angi, ¢1) 


in accordance with the inequality (Hf, f) 2 0. If we recall further that 
E,¢1 = ¢1, we see that 


(E,HE,¢1, ¢1) = (HEn¢1, Engi) = (Hei, ¢1), 


(Hngi, 91) = (He, ¢1)/(a + 2e). 
Thus we have 


1— (Figi, ar) = | - Fide |? S$ (AU — Fader, I — Fads) 
(Higi, v1) = (Her, ¢1)/(a + 2¢) 


ate 
a + 2 


lA 


IA 





by virtue of the fact that (J — F,,)¢: lies in N, and of our initial determination of 
¢i. Finally we obtain the inequalities 


€ 


< (F, = |F.¢.:|? s 7=1 
aoe = Pavts o) | Frgi |’ S |¢n| 


0< 





from those just established and from the fact that F,, is a projection. Thus the 
sequence {f,}, where f, = F,¢1, contains a subsequence converging weakly to a 
limit f in Jt; and this limit cannot be the null element, since 


(f, ¢1) = limno(fn, ¢i) 2 e/(a + 2e) 


when n becomes infinite through the subscripts of this subsequence. We confine 
our attention henceforth to this subsequence, renumbering its elements and 
those of the associated operator sequences {E,}, {Fn}, and {H,} accordingly. 
Since E,,g tends strongly to g and | E,g| < |g | for every g in N, we can apply the 
closing remarks of §2 to show that H*g tends strongly to H*g/(a + 2e)* for every 
gin %. It is now easy to show that H"f, converges weakly to H*f/(a + 2¢)*: 
for we have 


(AE S.,9) = (f,, Hkg) — (f, H*g/(a + 2)*) = (H*f/(a + 26)", 9) 
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for every g in Jt. Since F,,¢; lies in M,, where the upper bound of H, is not 
greater than 1, we see that 


| His, | = | AiF.v:| S|F.¢,| $1, 721, 


and hence, on passing to the limit, that 
| H*f/(a + 2e)*| <1, E21. 


Writing this inequality in the form | H*f| < (a + 2e)* and noting that f ~ 0 
implies o(f) = +, we conclude that a + 2e > 1. Since e was an arbitrary 
positive number, we must have a 2 1, as we wished to prove. 

We can now proceed to the case where 9 is entirely unrestricted. Taking f 
as an arbitrary element in Jt, we form the closed linear manifold N; generated by 
the elements f, Hf, H*f,---. This manifold, being separable, is either a 
unitary space or a Hilbert space. Obviously H carries Mt, into a part of itself 
and is therefore permutable with the projection operator of Jt;; and we may 
therefore confine our attention to the behavior of Hin N;. Since MN, is contained 
in Jt, we have o(g) = + foreveryg ~0inM,. Thus the results established 
in the two preceding paragraphs show that the lower bound of H in 9; is not less 
than 1. In particular we have (Hf, f) = (f,f). Since f is an arbitrary element 
of %t, we must have a = 1, as we wished to prove. 

We still have to show that, when a = 1, this bound is not attained. If it 
were, Theorem 3 would show that for some f ¥ 0 the relation Hf = f would hold 
in 3. Hence we would have | H*f | = |f| for k = 1 in contradiction with the 
relation o(f) = +. The contradiction shows that the bound a = 1 is not 
attained. 


5. Spectral Theory for Bounded Self-Adjoint Operators. It is now easy 
to extend Theorem 4 to the general case of a bounded self-adjoint operator A. 
We have 

THEOREM 5. If A is a self-adjoint operator with finite bounds a and 8, there 
exists a family of projections E(\), —~ << +, with the properties 
(1) E(A) is permutable with A and also with every bounded linear operator which is 

permutable with A; 
(2) uw S dX implies E(u)E(A) = E(u); 
(3) E(A) = Oforr < a; E(A) = I ford = 8B; 
) ife > 0, then E(A + Of — E(a)f strongly as « — 0, for every f in &; 
) in the range of E(d), the upper bound of A does not exceed d; . 
) in the range of I — E(d), the lower bound of A is not less than ; and, if equal 
to d, ts not attained. 
The projection E(d) is uniquely determined by properties (1), (5), and (6). 

Since H = A — al is a self-adjoint operator with bounds 0, 8 — a in accord- 
ance with Theorem 1, we can apply to it the results of Theorem 4, constructing 
the corresponding family of projections F(A). We define E(A) = F(A — a). 
Properties (2), (3), and (4) are then immediate consequences of the correspond- 


3 
(4 
(5 
(6 


oe ME ae eee st ooo ee - 























A net emenionle case 


862 B. A. LENGYEL AND M. H. STONE 


ing properties of F(A). Property (1) is a consequence of the permutability of 
F(A) with H = A — al and with every bounded linear operator which is per- 
mutable with H. Since the upper bound of H in the range of F(A — a) does not 
exceed \ — a, the upper bound of A = H + al in the range of E(A) = F(A — a) 
does not exceed A = (A — a) + a, by virtue of Theorem 1. Thus property (5) 
is established; and property (6) is proved in similar fashion. 

In showing that E(A) is uniquely determined by properties (1), (5), and (6), 
we may suppose merely that A is a linear symmetric operator. If H(A) and 
E'(d) are two projections having the indicated properties, then 


PQ) = EA)U — EQ) = UT — FO) EQ) 


is a projection with range which is the intersection of the ranges of E(d) and 
I — E'(\) and which is permutable with A: for the necessary permutability 
relations follow at once from (1). By (5) and (6), we see that, if the bounds of 
A in the range of P(A) are defined, the upper bound is not greater than \ and the 
lower bound not less than \, being unattained if it is equal to X\. These proper- 
ties are incompatible, so that the bounds of A are not defined in the range of 
P(A). Thus the range of P(A) consists of the null element alone, and P(A) = 0. 
The relation E(A) = E(A)E’(A) follows immediately. By symmetry we have 
E’(\) = E'(A)E(A) = E(A)E’(A) = EQ), as we wished to prove. 

It is now possible to prove with little difficulty the following approximation 
theorem: 

THEOREM 6. If A and E(d) have the significance indicated in Theorem 5; if the 


numbers \y = a, A1, +++» An—1, An = B, arranged in order of magnitude, determine a 
partition of the interval (a, B), where a and B are the bounds of A; if ee Be 
are arbitrary numbers such that \x-1 Sd; < rx fork =1,---,n;af 


6 = max;.i,..., n(Ak — Ag—1)3 
and 1f E(Ax) ts defined for the interval A, = (Ax—1, Ax) by the relations 
E(Ai) = E(A,),. E(Ax) = EQ) — EQi-1) fork = 2, --- , n3— 
then | Af — Dita ME (As)f | S 8 |f | for every f in &. 


It is easily seen by properties (1), (2), and (3) of Theorem 5 that the operators 
E(A;) are projections permutable with A and satisfying the relations 


E(A:)E(d,) = O for i # k, >> E(dx) = I. 
k=1 


Thus it is possible by elementary manipulations to write 


n 


Af - . ME(A)f, = >} | (A —- A, DE(A)f |. 


k=1 


a at 








"= ww 


ELEMENTARY PROOF OF SPECTRAL THEOREM 863 


The bounds of A — d;J in the range of E(A,) are now seen by virtue of properties 

(5) and (6) of Theorem 5 and by virtue of Theorem 1 to lie between — (A eng te 
/ 

and \, — 4,: When k = 2, --- , n, we have only to write 


E(Ar) = EQ) — E(du-1)) 


with the aid of property (2); and when k = 1, we use property (5) for the range 
of E(A1) = E(\,) together with the fact that the lower bound of A is \) = a. 
Of course, if H(Ax) = O for any k, the corresponding terms in the equality 
above all vanish and the bounds of A in the range of E(A,) are not defined. In 
all cases, we see that Theorem 2 now yields the result | (A — \,DE(d,)f| < 
6 | B(A,f |. Since we have >>7_, | E(A,)f |? = |f |2, we conclude that the in- 
equality stated in the theorem is valid. 

An immediate consequence of Theorem 6 is the following result: 

THEOREM 7. If A and E(\) have the significance indicated in Theorem 5, then a 
bounded linear operator T defined over % is permutable with A if and only if it is 
permutable with E(d) for every X. 

The necessity of the condition is obvious from Theorem 5, property (1). The 
sufficiency of the condition follows from the inequality of Theorem 6. If ¢ is any 
positive number, we determine a partition of the interval (a, 8) for which 6 S e 
and denote the operator }>7%_, \,E(A,) of Theorem 6 by A,.. By hypothesis 
T is permutable with A, ; and there exists a positive number vy such that | Tg | S 
y|g| for every gin %. Thus we have 


(TA — AT)f| S| T(A — Adf|+|(A — AdTS| 
<7|(A — Adf| +6|Tf| < 275 |f| < rel fs]. 


Since ¢ is arbitrary we conclude that (TA — AT)f = 0 for every f in &; in other 
words that TA = AT’, as we wished to prove. 

The inequality of Theorem 6 shows that for a sequence of partitions with 6 
tending to zero the expression >> %_; A ,H(Ax)f tends strongly to Af. From this 
fact it is easy to deduce the usual Stieltjes integral representations 


B 
(Af,g) = p Ad(E(A)f, 9); | Af |’ = MEO), f). 


We shall not consider the details, as they are sufficiently familiar from other 
treatments. 


IA 
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THE IMBEDDING OF MANIFOLDS IN FAMILIES OF ANALYTIC 
MANIFOLDS! 


By HassteR WHITNEY 
(Received February 26, 1935, revised February 10, 1936) 
I. THe THEOREM 


1. Introduction. Let fi, --- , frm be differentiable functions defined in an 
open subset of Euclidean n-space E,. At each point p at which all the f; vanish, 
let the gradients Vfi,--- , Vfs—m be independent. Then the vanishing of the f; 
determines a differentiable manifold M of dimension m. Any such manifold we 
shall say is in “regular position” in E,. Only certain manifolds are in regular 
position; see below. The purpose of the paper is to show that any m-manifold 
M in regular position in Z,, may be imbedded in an (n — m)-parameter family 
of homeomorphic analytic manifolds; these fill out a neighborhood of M in £,. 

We may extend the above definition as follows: M is in regular position if, 
roughly, there exist n — m continuous vector functions in M which, at each 
point p of M, are independent and independent of vectors determined by pairs of 
points of M near p. If M is differentiable, the two definitions agree; the Vf; are 
the required vectors. The theorem holds also for this more general class of 
manifolds. 

Some manifolds which are necessarily in regular position if they are differen- 
tiable in E,, are? 

(a) Any curve; it may have corners. 

(b) Any orientable surface. 

(c) Any orientable (n — 1)-manifold. 

(d) Any closed orientable (n — 2)-manifold. 

(e) Any manifold which is an m-cell. 

No non-orientable manifold can be in regular position, and no such manifold 
can be imbedded in an (n — m)-parameter family of manifolds. For examples 
of orientable manifolds not in regular position, see Sphere-spaces, 8(e) and (f). 

Some definitions and lemmas from a paper by the author, Differentiable Mani- 
folds, these Annals, vol. 37 (1936), pp. 645-680, will be used. We refer to this 
paper as DM. AE will mean the paper referred to in footnote 5 of DM. 





' Presented in part to the Am. Math. Soc., Sept. 4, 1934. A statement of the theorem is 
given in the Proc. Nat, Ac. of Sci., vol. 21 (1935), p. 463. 

*See H. Whitney, Sphere-spaces, Proc. Nat. Ac. of Sci., vol. 21 (1935), p. 467, 8 (d). 
(e) has been proved by W. Wazewski, Compositio Math., vol. 2 (1935), pp. 63-68. (d) has 
been proved also by H. Seifert, Math. Zeitschr., vol. 41 (1936), pp. 1-17. He proves here 
the existence of an analytic (and hence algebraic) approximation for the case of a closed 
manifold of class C?. 
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2. The independence of a set of vectors. This section will be used 
extensively in part III. With any point pis associated a vector, drawn from the 
origin to p; with this in mind, we shall use the terms point and vector inter- 
changeably (as long as we do not translate axes). Let || v || denote the length of 
v; then || ¢ — p || is the distance from p tog. If 0; is a vector with components 


Vil, +++ , Vin, then from 1, --- , ¥s we form the matrix 
Vn °° Vin 

(2.1) M =| ......-- 
Vs1 + Usn 


The square root of the sum of the squares of all s-rowed determinants D,,.... ;, 
from M we call the independence of 1, --- , vs; this is also the square root of the 
determinant of the scalar products of the v’s, by the general Lagrange identity ? 


Vi-V, + Uji-Us ; 
2 pen 
(2.2) I(y,, «++, %) = [>> D?,... , R= | ee eee ee ee 
Vs Vi Vs Vs 
I(v,;, --- , ¥s) is a continuous and symmetric function of 1, --- , vs; it is inde- 


pendent of the (rectangular) codrdinate system employed, as each 0;-0; is. 
Geometrically, it is the s-dimensional volume of the parallelopiped determined 


by v1,---, Us. It vanishes if and only if 1, ---, v, are dependent. 
Some properties of I following at once from (2.2) are: 
(2.3) I(v) = || vl, 
(2.4) I(v, + cde, 2, +++) = (n,m, +++), 
(2.5) I(cv,, 2, --:) = lo | I(, ve, +--+: ). 
Choose the x2, --- , z.-axes so that v2, --- ,v, lie in the x2, --- , z,-plane, and 
choose the z-axis so that v, lies in the 2, --- ,2.-plane. Writing out the result- 


ing matrix (2.1) and using the first expression for I in (2.2) gives, as 
lou | S || % Il, 

(2.6) I(v,, v2, +++ Us) S || 1 || L(ve, --+ , vs). 

Repeated use of this inequality gives 

(2.7) I(n, +++, 0%) S lm |] --+ [lee ll, 


an extension of Hadamard’s inequality on determinants. A similar choice of 
axes with the addition theorem for determinants gives 


IA 


(2.8) I(v, + v1, i, *** ) Ss I(v;, me 2" ) + Iv, _""" ). 
Suppose 
(2.9) W = QV; -- sae oe QsV5. 





3 See O. Schreier and E. Sperner, Analytische Geometrie I, Leipzig 1931, pp. 113-114. 
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By (2.4) and (2.5), 
I(w, Vea, *** Vs) = I(ai;, Ve, +++, Vs) — | ay | I(v, vee, vs). 
Hence, using (2.6), of I(v1, --- , vs) ¥ 0, 


I(v., see » Vs) 
I(0, sai , 0%)’ 


with a similar inequality for each other | a; |. In (2.8), replace v, andv, + v, by 
v, and w;; we obtain with (2.6) 


I(wi, 2, +++) S L(vi, m2, ++) + |] wi — || L(r, --- ). 
A similar inequality with v,; and w, interchanged gives, with this, 
(2.11) | L(wi, 2, --- ) — Tri, 2, ---) | S || wr — m || Ze, --- ). 
An s-fold application gives, using (2.7), if the v; and w; are unit vectors, 
(2.12) | I(wi,--+, We) — T(r, ---, ve) | S |] wr — mi] +--+ + || we — 2, II. 


A vector function v(p) defined in a point set M is of Class C’ or analytic if each 
of its components is. It satisfies a local Lipschitz condition in M if there exists a 
positive continuous function K(p) in M such that if p and q are in M, then 


(2.13) || oq) — o(p) || S K(p) |la — p|| (Ilq — p|| < 1/K(p)). 


For this to hold, it is evidently necessary and sufficient that each component of 
v(p) satisfy a local Lipschitz condition. 





(2.10) lai] < || w]| 


3. Manifolds in regular position in Z,. Let M be a point set in Euclidean 
n-space E,. We shall say M is an m-manifold in regular position in E,, if the 
following is true: 

(1) Each point p of M is in a neighborhood U of pin M which is an m-cell.* 

(2) There are n — m continuous unit vector functions v:(p), --- , Un—m(p) 
and a positive continuous function p(p) < 1 defined in M, such that if p and g 
are in M and || g — p || < p(p), then 


(3.1) I[v; (p), +++ 5 Mn—m(p), 9 — p] = e(p) Ila — Pl. 


If there is a limit set LM (see DM §2), we take p(p) less than the distance 
from p to LM. 

(3.1) says that for a certain « > 0, the vector g — p makes an angle 2 e with 
the (n — m)-plane determined by 1(p),---, Un—m(p). From (3.1) we find, 
using (2.6), 


(3.2) I{v; (p), Pisit Un—m(P)| 2 p(p), 


and thus 1(p), «++ , Wn—m(p) are linearly independent. 





* That is, U is homeomorphic with the set of points Y + ae x? < 1in m-space. 
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If M is of class C*(r = 1), an equivalent formulation of (2) is evidently: If » 
is any unit vector parallel to M at p, then 


(3.3) I{vi(p), «++ , Ur—m(p), »| = p(p). 


In other words, the tangent plane to M at p and the plane of »,(p), --- , dn -m(p) 
are independent. 

Let M and M* be manifolds of class C"(r 2 0), and let n(p) be a positive 
continuous function defined in M. We say M* approximates to M through the 
r‘ order with an error < (p) if M and M* are homeomorphic and the following 
is true: 

(A) If pand p* are corresponding points of M and M*, then || p* — p || < n(p). 

(B) If r > 0, let 7 be the m-plane tangent to Mat p, and let x;, --- , 2, bea 
set of rectangular axes such that 7 is the z,, --- ,z,-plane. Let 


, , , ! *k, ? , ‘ ‘ 
Be = IhS1, +++ 5 Zea) Zz; = J; (e1, --+ » Su) (@=m-+1,---,n) 


be functions (of class C’) determining a neighborhood of p in M and a neighbor- 
hood of the corresponding point p* of M*. Then if qg and q* are the projections 
of p and p* into T, 


(3.4) | Dif (Q") — Dif(q) | < n(p) (ox Sr;i=m+1,---,n). 


For the notation, see DM §6, or AE. 

THEeorEM I. Let M be an m-manifold of class C*(r = 0 and finite) in regular 
position in E,,, and let n(p) be any positive continuous function definedin M. Then 
M can be imbedded in an (n — m)-parameter family of manifolds 


(3.5) M(c1, --+ , Cn—m), les| <1 (¢ = 1,---,n — m) 


with the following properties: 

(1) M(O, --- ,0) = M. 

(2) M(a, --+ , €n—m) ts analytic if (C1, «++ , Cn—m) ¥ (0, --- , 0). 

(3) Hach manifold M(c,, --- , Cn—-m) approximates to M through the r order 
with an error < (p). 


(4) If 
(3.6) p* = Z(p; cr, «++ » Cn—m) 


is the point of M(c,, --+ , Cn—m) corresponding to the point p of M, then Z 1s a con- 
tinuous function of its arguments, and the points p* fill out an open set R containing 
M in a one-one way; hence no two of the manifolds intersect. The points p* for p 
fixed fill out a portion of an (n — m)-plane through p. 

(5) If the sets (c\', --- , c2,)(i = 1,2, ---) approach the set (c1, --+ , Cn—m); 
then the manifolds M(c\'?, ..- , c\,,) approach the manifold M(a, +++ » Cn—m); 
together with partial derivatives of order S r. 

(6) If M is of class C® and has a boundary“ M’, then we can make higher and 





* Boundary shall thrcughout mean limit set. 
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higher partial derivatives of M(c:,--+ , Cn—m) approximate to the corresponding 
derivatives of M as we approach M’. 


II. THe Proor, M DirrerentTIABLE 

4. The vector functions w;(p). Set s = n — m. The vector functions 
n(p), +++ » ¥e(p) defined in M may not satisfy a local Lipschitz condition; we 
shall replace them by unit vector functions wi(p), --- , w.(p) which do, and will 
in a certain sense be analytic. 

Let R, be an open set containing M but no boundary points of M. (For 
instance, Ri: = all points whose distances from M are less than half their dis- 
tances to the boundary M’ of M if M’ exists; otherwise, R; = E,.) Extend 
each component »;,;(p) of each v;(p) so it will be continuous in R;.5 Set ¢(p) = 
o(p)/(4ns) and extend it so it will be positive and continuous in Ri. Define the 
analytic functions w; }(p) in R, by AE Lemma 6 (DM Lemma 9) 1 so that 


(4.1) | wi(p) — v4(p) | < £(p) in Ry. 
Now for any pin M, 
|] 5 () I — 11 = I Il sm) I — Uo) || S | wip) — 0) || 
< x | w;(p) — r4(p) | < né(p) < 1, 

and hence || w,;(p) || + 0 in a neighborhood R: of M. Define the unit vector 
function w,(p) = w;(p)/|| w;(p) ||. Then 
I| w; || 

|| wi(p) — vi(p) || S ne(p) + ng(p) = p(p)/(2s) in M. 
Hence, using (2.12), if u is a unit vector, 


(1 — || w; ||) + (ws — a), 


Oz—- 0: = 


(4.2) | I{wi(p), laa w(p), u] “y I{v(p), aa ae v.(p), uJ | Ss 3p(p), 
which shows that (3.3) holds with p(p) replaced by 3p(p). Therefore M is in 
regular position in terms of w,---, ws. As the first partial derivatives of the 


w,; are bounded in a neighborhood of any point of Re, the law of the mean shows 
that each w;; and hence each w; satisfies a local Lipschitz condition in Rz and 
hence in M: There is a continuous function K(p) = 1 in M such that 


(4.3) || we(q) — wi(p) || S K() Ila — Pll (lla — pll < 1/K(p)). 


5. The functions ¢;(p). Given any point p of M and any set of numbers 
M1, +++ ,as(s = n — m), define the point p* by 


(5.1) p* =p + > ai wi(p) = G(p, ase? » Ge). 


i=1 


We wish to solve this equation for p and the a; in terms of p*. 





* Almost any method of extending continuous functions can be used to do this. 
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Lemma 1. There is an open set R3 containing M, and a vector function H and 
functions ¢;, all of class C*, 


(5.2) p = H(p*), ai = oi(p*) (¢ =1,---,8) 
defined in Rz, such that 

(5.3) H{p + > aiwip)] = p, dlp + Do aiwi(p)] = a 

if the point in brackets is in Rs, and conversely, if p* is in Rs, 

(5.4) H(p*) + Do ¢i(p*)wilH (p*)] = p*. 


This follows at once from the proof of DM Lemma 21. 

Lemma 2. Let T’(p) be the s-plane through p in M determined by wi(p), ---, 
w.(p), letx,, «++ , x, be rectangular axes chosen so that T’(p) is the ae a 
plane with p at the origin, and with x; in the wi(p)-direction, ay in the plane of 
wi(p) and we(p), etc. Then of di(a1, ae In) = ; (x1, ek i P), 


yt = Abr) = 5 bs) 


(x1, eels. 2;) 


If we add hw;(p) to both sides of (5.1), we obtain 





(5.5) ~ 0. 


p* + hwx(p) =pt+ yy (a; + 5ih)wi(p). 
i=l 


This, with (5.3), gives 


(5.6) oi [p* + hux(p)|] = oi(p*) + bixh, if p = H(p*). 
From this it follows that | 
(5.7) Voi-wr(p) = bx, 

OO: — 0 (k <1) and ¥ O(k = 9), 


Xp 


and hence J’ # 0. 

Lemma 3. Let T(p) be the tangent plane at p in M, and let %1, --+ , En be rec- 
tangular axes chosen so that T(p) is the %,--+ , Em-plane, with p at the origin. 
Then tf oi(a1, «++ Xn) = bili, «++ ,En5 7D), 





(5.8) J\, = ae af). : ~ 0. 
By the last lemma (using 2,, --- , 2, in place of a, --- , 2a) 
(5.9) (Voi, --- , Vos) # Oon M. 
Now express J in terms of the %,, --- , Zn, codrdinate system. As 0¢;/8%; = 


O(j < m) at p, all determinants in the new expression except possibly the deter- 
minant in (5.8) vanish; hence this determinant in absolute value equals J, and 
(5.8) is proved. 
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6. Two lemmas on implicit functions. We shall use here the symbols c,, 
D, if nis replaced by m. 
Lemma 4. Let the functions 


P;(21, «++ , Xs) (¢ = 1,---,8) 


be defined and of class C' in a neighborhood V of the origin qo, let them vanish at qo, 
and let the Jacobian be ¥ 0 there. Then for any « > 0 there is a 5 > O with the 
following property: If ®,, --- , @. are defined and of class C! in V, and 


00; _ 09; 
02; 02x; 





(6.1) | 0; id G; | < 6, 








<6 G,j=1,---,8) 


in V, then the Jacobian of the 0; is ¥ 0, and there is a unique point q* in V such 
that 
(6.2) lla* — gl < «, 6:(q*) = 0 (@ = 1,---,8). 


Moreover, 5 can be made continuous in the ®;, their derivatives, and €; the point q* 
is continuous in the 0;. 
Lemma 5. Sets=n—m. Let the functions 


@,(21, --- , Ln) (¢ = 1,---, 8) 
be defined and of class C’ (r = 1) in V, the set of all points within a distance p of the 
origin po, let them vanish at po, and let 


a(h,, ees , B) 


= ~ 0. 
8(2m+1, seh In) Po 





Po 


Then for every « > 0 there is a6 > O with the following property: If Q:, --- , Os 
are defined and of class C* in V, if 


(6.3) |DiO(p) — Di®i(p)| <6 Or Sr;t=1,---,8) 
inV, if p* = (at, «++ , 2%) and 
(6.4) Il p* — poll < 4, 8:(p*) = 0 (j= 1,---,8), 
and if 

Vili, +++, Lm), fi(t1, +++ » Lm) (¢ =1,---, 8) 
are continuous functions defined for xi + --- + 2, < & such that 
(6.5) ¥s(O, +++, 0) = 0, fatty +++ 4 Bm) = mer 
(6.6) @(t1, +++, Em, V1, °°, Vs) = 0, 
(6.7) Oi(a1, --- » tm,Jiy°°° fe) = 9, 


Mar, +++ 2m for i =1,---,8, then they; and the f; are of class C" and 





** Which we take sufficiently small. 
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(6.8) | DifAzi, ra , Zn) —_ DiwWA0, deiba , 0) | <e€ (o; < r) 


fori =1,---,s. Moreover, 6 is continuous in p, €, and the ®; and their derivatives, 
These lemmas are forms of the implicit function theorem, with uniformity and 
continuity conditions. They may be proved by standard methods. 


7. Proof of Theorem I, r 2 1. Given any point p, of M, we may choose 
axes and define the functions $;(%1, --- , Zn; po) as in Lemma 3. Moreover, we 
may evidently choose such axes in a continuous manner for each point p of a 
whole neighborhood U of pp in M; the ¢; will then be continuous in 2, --- , z, 
and p for (a1, --- , tm) near (0,--- , 0) and pin U. Lemma 3 shows that the 
conditions of Lemma 5 are satisfied; the number p may obviously be taken 
continuous in U. Hence, covering M by a set of neighborhoods U, we see that 
there is a positive continuous function (p) < n(p) in M such that if (6.3) 
through (6.7) hold with ®; and 6 replaced by ¢; and 91(po), then (6.8) holds with 
e replaced by n(po). 

Next, define T’(po) and axes x, --- , 2, in a continuous manner for any p in a 
neighborhood U of po as in Lemma 2. For points of T’(po), the functions ¢, 
are then 


(7.1) ¢; (21, pam » Bes 0, a 0; Po) as (24, vam A »%e3Pes 


are defined in a neighborhood of the origin, and are continuous in #;, --- ,Z., Dp: 
As J’ ¥ 0, the conditions of Lemma 4 are satisfied. Hence there is a positive 
continuous function Q(p) < (p) in M such that if (6.1) holds for axes and 
functions corresponding to po in M and with 6 replaced by (po), then (6.2) 
holds with ¢ replaced by 21(pp). 

Next, set Q2(p) = %(H(p)) in R3. Set 


(7.2) Q(p) = %(p)/(2 N,), 


where N, is a number such that if F and F’ are the expressions for any function 
of class C* in terms of any two sets of rectangular axes, and 


(7.3) |\D.F| Sax S71), then | D, F’ | S N,a (ox S 7). 


Let 2’(p) be the minimum of Q(p) and the distance from p to M. Then 2'(p) 
is continuous in Rs, zero in M, and positive in R; — M. 
By DM Lemma 9 there are functions ¢; (p) analytic in R; — M and such that 


(7.4) | Dei (p) — D,¥;(p) | < 2"(p) in Rs (ox Sr;i=1,---,§); 


or at least in R; — M. If we set ¢;(p) = $:(p) in M, then ¢; (p) is of class C’ 
in Rs, and (7.4) holds in M also, by DM Lemma 10. By DM Lemma 25 there 
is a positive analytic function w(p) in R; such that 


(7.5) | Dew(p) | < Q(p) in Rg (o. S17). 


Take any set of numbers 


fol 


tio 


of : 
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(7.6) C = (C1, +++, Cs), le:| <1 (¢=1,---,8). 

Corresponding to this set we define in R; 

(7.7) 0,(p;¢) = $;(p) — c,«(p) (§ = 1,---,8); 

these functions are analytic in R; — M. (7.4), (7.5) and (7.6) show that 

(7.8) | DiO(p;c) — Dedi(p) | S 2 Q(p) ( Sr;t=1,---,8) 

in R;. The manifold M(c) is defined as the set of points p in R; for which 

(7.9) 9i(p;c) = 0 (@=1,---, 8). 
Given p in M and T’(p), choose axes x,, --- , 2, as before, and consider the 


functions @,(z,, «++ , 2,), the values of the 6;(p; c) on T’(p). (7.2), (7.3) and 
(7.8) show that (6.1) is satisfied with 6 replaced by %(p) ; hence there is a unique 
point 

(7.10) p* = Z(p; °c) 


in T’(p) within ,(p) of p at which (7.8) is satisfied. Lemma 7 shows also that 
p* moves continuously as we move pin M. If we let p* in M(c) correspond to p 
in M, Lemma 1 shows that M(c) and M are homeomorphic. 

Evidently the function Z is continuous in both p and c. No two of the mani- 
folds M(c), M(c’)(c ¥ c’) have a common point, as follows at once from their 
definition. Given any point (7.10), the corresponding c is given by: 

cs = $;(p*)/w(p*). 
As ¢; and w are continuous and R; is open, to each p’ near p* there corresponds a 
c',|¢;| <1. Hence the points (7.10) fill out a neighborhood R of M. 

Take pp in M, a small neighborhood U of pp in M, and a fixed c. For each p 
in U there is a p* = Z(p;c); using the axes of Lemma 3, these points p* are given 
by functions f; as in (6.5). As O.(p*) = 6:(p*; c) = 0 at these points, (6.7) 
is satisfied. Setting c = 0 shows that (6.6) is also. By (7.8) and (7.3), and as 
\|Z(p; c) — p|| < &(p), we may apply Lemma 5; (6.8) now completes the 
proof of (3) in Theorem I. Finally, the M(c) are analytic for c * 0, as the 
9(p; c) are analytic in Rs — M. (5) is obviously true. If r = © and we wish 
to have (6) satisfied, we use AE Lemma 6 with m = ~. Theorem I forr 2 1 
is now completely proved. 


III. Toe Proor, M not DirreRENTIABLE 


If M is not differentiable, the functions ¢; will be continuous, but not differ- 
entiable. However, they will have directional derivatives in the directions of the 
w;; we may approximate to the ¢; by analytic functions having similar direc- 
tional derivatives. To show this, we first need some preliminary lemmas. 


8. Averages of a function. Let S(p, \) denote the set of points q within \ 
of p, let S’(p, 4) denote its boundary, let V(A) be the volume of S(g, )), and let 
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V’(A) be the (n — 1)-volume of S’(p, d). If ¢(p) is a function continuous in the 
open set R, we may define the average 


8.1) AQ@,») = ahs I 60) de 


Similarly we define B(p, d), the average of ¢ over S’(p, 4). A simple computa- 
tion (noting that V’(A) = aV(A)/dA) shows that A(p, d) has a continuous deriva- 
tive with respect to d, given by 


(8.2) = Ap, ») = S1B(p, ») — A(p, d)]. 


Also, A(p, d), as a function of p, has continuous first partial derivatives. (In 
fact, the directional derivative of A(p, d) in the direction of the vector v is given 
by taking the formula for B(p, d), and putting in the integrand the factor cos 6, 
where @ is the angle between g — p and v.) 


9. Approximation to a certain continuous function by an analytic function. 
We prove here 

Lemma 6. Let $(p) be a continuous function defined in the open set R, let 
Vi(p), --+ , Ws(p) be continuous in R, and let w(p), --- , ws(p) be continuous unit 
vector functions in R. Let there be a continuous function p(p) = 1 in R such that 
uf p and q are in R and 


q=p+hlw(p) + y], 


(9.1) 
lla — pll < 1/u(p), |h| < 2/u(p), 
then 
(9.2) $0) = $9) _ yép)| ¢ w(p) Il all 
fort =1,---,8. Then for any positive continuous function ¢(p) in R there is an 
analytic function ¢'’(p) in R such that 
(9.3) | ¢’"(p) — o(p) | < ¢(p), | Vo""(p)-wi(p) — wip) | < S(p) 


for any pin Randi = 1,---,s.8 

It is sufficient to find a differentiable function ¢*(p) such that (9.3) holds with 
¢(p) replaced by 3¢(p); we can then at once find the required analytic function 
by DM Lemma 9. It may be shown that the analytic approximation to ¢(p) 
given in the proof of AE Lemma 6 has the required property, but this is not very 
simple. 

We show first that ¢(p) satisfies a local Lipschitz condition. Let Ii(p) 2 1 
be a positive continuous function in R such that 


° The lemma evidently holds still if we add } ¢(p) to the right hand side of (9.2). 





2 Piha rabies 





Take 


then 
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Take p and gin R, ||¢g — p|| < 1/u(p). Set 


h=2\la—pll, pt=pthui(p), 1=02-, 





q = p* + hn = p + hlwi(p) + ol. 


bo] 


llnll $F llla— ll + llp* — pill = 


Hence, by (9.2), #f ||g — p || < 1/u(p), then 
9.5) | ¢(g) — $(p) | < 2[Li(p) + 3u(p)/21 Ig — p || = La(p) Ila — pli. 
Take pin R and \ < 1/y(p), and define A(p, d) by (8.1). (9.5) gives 


1 
08) [A — 60)1 < ay [ oy NaC) da = Nal), 


with a similar inequality for | B(p, 4) — ¢(p) |. Hence, by (8.2), af we keep 
\< 1/u(p), then 


(9.7) 3 A(p, d)| < 2nLe(p). 





Let ¢’(p) be a positive continuous function in R so small that 
(8) ) <1/ulp), () < 5(p)/[8n*La(0)], 
and if || ¢ — p || < ¢’(p), then 

|| wi(g) — wilp) || < $(p)/[16 u(p)], 
| vq) — vip) | < S(p)/8, — w(Q) < 2u(p), 


fori = 1,---, 8s, andif ||q’ — p|| < 2¢'(p), theng’ isin R. Let A(p) be any 
positive differentiable function in R such that 


(9.9) 








(9.10) | A(p) |, ~ < ¢'(p) (j= 1,---,n) 
(see DM Lemma 25). We define ¢*(p) by 
(9.11) ¢*(p) = A(p, A(p)). 


To prove (9.3) with ¢(p) replaced by 3 ¢(p), take any 7 and any p in R. Take 
any h such that |h| < ¢’(q) for ||q — p|| < Mp), and set & = hw,(p). Take 
any g, ||q — p|| < d(p), and set » = w.(p) — w:(q); then g and q + éare in R, 
and 


gq+§= 44 Afwscq) + a. 


$~ — een erp eer? 
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Applying (9.2) and (9.9) gives 





o(g + . — (9) _ vil) | < HOg) Sp) $C) & $0), 





16 u(p) 8 4 
Choose axes 21, ---, z, so that w;(p) lies in the z,-direction. Knowing that 
A(p, d) is differentiable, the last inequality gives (keeping \ fixed) 
(9.12) | 2, A(p, d) —-vilp) | < ip), 
| ax, 4 
Hence 
ag*(p) ng(p) , S(p) _ §(p) 
| oO vi(p) | < 2n Tal) aT) + 4 2” 


proving the second inequality of (9.3). The first follows from (9.6), (9.10) and 
(9.8). The lemma is now proved. 


10. The functions ¢;(p). Given p in M, let p’(p) be the smaller of the two 
numbers p(p), 1/K(p) (see (4.3)), and take p’’(p) S 3 p’(p) positive and con- 
tinuous in M so that if || q — p || < p’’(p), then 


(10.1) 3 p'(p) < p'(q) < 2p'(p), = 3 w'(p) < p(Q) < 2p(p). 
Let T*(p) be the set of points (5.1), with 
(10.2) |as| < p’’(p)/[3 s K(p)). 


Lemma 7. Set K’(p) = 12/p(p) in M. If p* and q* are in T*(p) and T*(q) 
respectively and || q* — p* || < 3 p’’(p), then 
(10.3) lla—pll = Kp) ll * — p* ll. 

Say q* =q+28w,(q). Subtracting (5.1) from this and using (10.2) gives 


p’'(p) + sp’(p) , sp’’(q) 


la Pil< 3° + 3eR@ + 3K)’ 








We show now that 
(10.4) lla — p|| < min [p’(p), p’(q)]. 
According as p’’(q) S p’’(p) or > p’’(p), we find 
lla — pil < e'(p) S 3 e'(p) < o'@), 
lla — pil < p’(q) S 30) < p'(p). 
Set 
(10.5) A; = wi(q) — wi(p); 
then 


q—-p=q — pt+ Dd (ai — Biwi(g) — Do adi. 


Appl 


(11. 
Tak 


(11. 


As | 
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Applying (2.4), (2.7), (4.3) and (10.2) gives 
I= I[wi(q), <4 ,» w.(q), ;- p| 
= I[wi(q), --- , ws(q), g* — p* — pk aiA,] 


Slat — pti + iq - pI. 


Also, by (3.1) with v; and p replaced by w; and 3 p, and (10.1), 


I 2 \\¢a — pil 4 (@) > Ila — pil i e(p). 


That we may apply (4.3) and (3.1) follows from (10.4). The last two inequali- 
ties give (10.3). 


Lema 8. The sets T*(p) fill out a neighborhood R’ of M ina 1-1 way. 

By the last lemma, no two sets T*(p), T*(q) (q ¥ p) haveacommon point. To 
show that R’ is an open set, take p» in M, and let U be a neighborhood of 7» in M 
which is an m-cell. Map U into a portion of E, with its last n — m coérdinates 
zro. If pin U is mapped into p’ in E,, take any p* of (5.1) and map it into the 
point with the same first m coérdinates as p’ and with ae, --- , a, as its last 
coérdinates. The resulting region S’ of E, is homeomorphic with the set S of 
points on T7*(p), pin U. By the theorem on invariance of regions, S is a region. 
R’ is the sum of such sets S, and hence is open. 

We may now define the functions ¢; and H as in Lemma 1, and they will be 
single valued and continuous in R’. 
11. The difference quotients of the ¢;(p). We first extend the definition 
of the wz(p) over R’ by setting w,(p) = wx(H(p)). 

Lemma 9. The functions o(p) = ¢:(p), ¥i(p) = bx, wi(p)(t = 1,---, 8) 
satisfy the conditions of Lemma 6 for each k = 1, --- , sand an open subset R of R’ 
containing M. . 

Let R be the set formed by keeping a; less than one half the right hand side of 
(10.2). Let u(p) be a positive continuous function in M so large that 


(11.1) u(p) > 9/p"(p), — 3[K'(p) K(p) + K'(p) + 1/e(), 


and if u(p*) = u(H(p*)) in R and || p’ — p* || < 2/u(p*), p* in R, then p’ is in 
R’. Now take any p* and q* in R and any h so that 


(11.2) llq* — p* || < 1/u(p*), = | h| < 2/u(p"). 

Take a fixed 7, set p = H(p*), q = H(q*), and define a;, 8;, n, p’ by 
pt =pt+ dLawi(p), gt =¢+ dL (ai + Biwi), 
qt = p* + hlwi(p) + 1), =p’ = p* + hwi(p). 

As || p’ — p* || < 2/u(p*), p’isin R’. By (11.3) and (11.1), 


(11.3) 
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(11.4) lla* — p’ || S |la* — p* ll + 1A] < 3/u(p*) < 3 0(p); 
hence we may apply (10.3), giving, with (11.1), 


(11.5) = |la—pll < K’(p) || a* — p’ ll < 3 K(p)/u(p) < 1/K(p). 
This shows that we may apply (4.3), giving with (10.5) and (11.5), 
(11.6) = || A; || S K(p) K’(p) || a* — pv’ Il = K(p) Kp) [hl Il al, 
as q* — p’ = hy. From (11.3) we find 
q — p = hwi(p) + hn — Do (aj + BDA; — DB j0;(P). 

Hence, by (11.5) and (11.6), 

l| Do (8; — dssh)w,(p) || < [K’(p) + 1 + K(p) K"(p)] |h| Il al. 
Dividing by | h | and applying (2.10), (2.7), and (3.2) with v; and p replaced by 
w; and 3 p gives, with (11.1), 
(11.7) [| Bi/h — dix || < w(p) Il a II (kK =1,---,8). 
As By = $x(q*) — ox(p*) and u(p) = u(p*), (9.2) is proved. 

12. Proof of TheoremI,r = 0. The proof follows the lines of §7, except that 
we do not use Lemmas 3 and 5. Given a point »» of M, we may obviously take 
a neighborhood U of p» and choose axes x1, --- , x, as in Lemma 2 in a continu- 
ous manner for pin U. For each pin U we have functions (7.1) satisfying the 
conditions of Lemma 4, (by Lemma 2, which holds even if r = 0). Take 


%(p) < n(p) so that if (6.1) holds with 6 replaced by 9(p), then (6.2) holds with 
e replaced by n(p). Set Q(p) = M(p)/(2n), and define 2’(p) as before. 


By lemmas 9 and 6 we may define the functions ¢; (x1, --+ , tn) in Rs — M so 
that, setting ¢; = ¢; = Oin M, 
(12.1) | ¢i(p) — oi(p) |, | Vex (p)-we(p) — du | S 2'(p) 


in R;; the ¢, are continuous in R;. We now define w(p) and the 6;(p; c) by 
(7.5) and (7.7). (7.8) is replaced by 


(12.2) |0:(p;c) — o:(p) |, — | VOs(p; c) - wip) — 5x | S 2 Q(p) 
in Rs. M(c) is defined by (7.9). The rest of the proof goes as before. 
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ON THE PROJECTIVE GEOMETRY OF PATHS 
By L. Berwaup 


(Received March 7, 1936) 


Introduction. The present paper arose from the attempt to understand, 
purely from the standpoint of the geometry of paths, (that is, without going back 
to the “tangent spaces” of the underlying manifold) the projective theories of 
T. Y. Thomas! and O. Veblen,? and especially the introduction of the additional 
coérdinate x° in these theories. Naturally it is only the particular case of a 
symmetric projective connection with an uniquely determined system of paths* 
which had to be considered. In that attempt I was led to a new conception 
of a projective geometry of paths which is a generalization of the usual notion. 
This new conception is explained in §1. Besides the invariance under projective 
changes of the affine connection, a particular parameter on the paths, called the 
projective normal parameter, here plays a principal part. In §2 I give a first 
mathematical formulation of the fundamental idea. The projective normal 
parameter is defined by means of a Schwarzian differential equation up to linear 
fractional transformations. It is this Schwarzian equation which is funda- 
mental for the following work. Then I show (§3) that Veblen’s gauge variable’ 
is connected with the transition from the affine parameter on the paths to the 
projective normal parameter, and I consider the place of Veblen’s transformations 
of the representation® in the projective geometry of paths. In §4 the generaliza- 
tion of the Weyl projective curvature tensor’ for an arbitrary projective geom- 





1T. Y. Thomas, On the projective and equi-projective geometries of paths. Proc. N. A. 8. 
11 (1925), pp. 199-203; A projective theory of affinely connected manifolds. Math. Zeitschr. 25 
(1926), pp. 723-33. See also T. Y. Thomas, The differential invariants of generalized spaces. 
Cambridge University Press 1934, X + 241 pp. This book will be referred to as D. I. 

*0. Veblen, Generalized projective geometry. Journal London Math. Soc. 4 (1929), pp. 
140-60; Projektive Relativitatstheorie. J. Springer, Berlin, 1933, V + 73 pp. This book 
will be referred to as P. R. 

*O. Veblen, P. R., pp. 29-30. 

‘ This parameter was introduced by J. H. C. Whitehead, The representation of projective 
spaces. Ann. of Math. 32 (1931), pp. 327-60; see especially §3. An analogous parameter 
on the geodesics of a general projectively connected manifold was used by Enea Bortolotti, 
Sulle connessioni proiettive. Rend. Circ. Mat. Palermo 56 (1932), pp. 1-57, especially pp. 
30-31. For a two-dimensional Finsler space the preferred projective normal parameter 
(see §9) was introduced by Funk under the name of “linearisierender Parameter.’’ See 
P. Funk, Uber zweidimensionale Finslersche Réume, insbesondere tiber solche mit geradlinigen 
Extremalen und positiver konstanter Kriimmung. Math. Zeitschr. 40 (1935), pp. 86-93. 
Here the Schwarzian differential equation appears which is fundamental for my view of the 
subject. 

*P.R., p. 10. 

°P.R., p. 10. 

"H. Weyl, Zur Infinitesimalgeometrie: Einordnung der projektiven und der konformen 
Auffassung. Gdttinger Nachr. 1921, pp. 99-112, especially p. 101. 
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etry of paths is given. §5 is devoted to an important particular case in which 
the projective normal parameter is already determined by the differential equa- 
tions of the paths alone. In §6 a second method of treating a projective geom- 
etry of paths is given which corresponds to the theory of T. Y. Thomas. This 
method leads very naturally to the generalization of the Weyl curvature tensor, 
found in §4 (§7). In §8 I discuss briefly the equivalence problem for two 
projective geometries of paths. 

Paragraphs 2-8 deal with the restricted geometry of paths in the sense of J. 
Douglas.* In §9 I explain, as concisely as possible, the corresponding ideas 
for the general geometry of paths. 

As far as possible, the notations are choosen in conformity with the existing 
literature, at the risk of being not always quite consistent. When I refer to a 
function without further qualification, I shall mean an analytic function regular 
at all points (or line-elements®) under consideration. Repeated indices indicate 
summation. 


1. Projective geometry of paths. The conception of a projective geometry 
of paths which we shall develop in the present paper rests on two notions: 
the notion of the class of affine connections belonging to a system of paths and the 
notion of a projective normal parameter of a system of paths. The class of affine 
connections belonging to a system of paths consists of all symmetric affine connec- 
tions for which the system of paths is identical with the system of autoparallel 
curves. Any connection of the class is derived from any other connection of 
the class by a projective change of the affine connection.’ 

A projective normal parameter of a system of paths is a parameter which has 
the following properties: 

(a) It is determined, up to an arbitrary linear fractional transformation, on 
all paths of the system at the same time. 

(b) It is not altered by transformations of coérdinates. 

(c) It remains the same for all affine connections of the class belonging to the 
system of paths. 

By every one of the ~* possible choices of a projective normal parameter, a 
projective scale is fixed on all paths of the system at the same time. 

By a projective geometry of paths we understand the simultaneous theory of 
the class of affine connections belonging to a system of paths and of a projective 
normal parameter of this system. 

In the preceding statements, we can understand the notion of a system of 
paths in the restricted or in the general sense." Corresponding to this, we must 
understand the notion of the affine connection in the restricted or in the general 
sense. An affine connection is called restricted if its components are point func- 
tions, general if they are functions of the line-element.” 





§ J. Douglas, The general geometry of paths. Ann. of Math. (2) 29 (1928), pp. 143-68. 
9 In §9. 

10H. Weyl, l.c., pp. 99-100. 

1 Cf. J. Douglas, |.c., pp. 144. 

12 See §9. 
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9. Restricted projective geometry of paths. We consider an n-dimensional 
manifold Mt referred to a definite coérdinate system (z‘). By a path (in 
the restricted sense") we denote a curve x‘ = z‘(t) given as solution of a system 
of differential equations 


dx* { d?z! ; dz™ =) dx! (‘= , dx! dx* 

co Ep ie ice Ml backed D> Bp 
2) a (3 TOP me Ge at) ~ ae \ae tl aa 
Here the I'5,, are given point functions symmetric in the lower indices: 
(2.2) oh = A je 


On introducing a new parameter s by means of the substitution 











d*s = d*x ; dx dx 

0s) eet ae 

) ds — 
dt dt 





we can write the differential equations (2.1) in the form 


Oe 5: ‘ dx’ dx* 
ds? /* ds ds 


sis called an affine parameter of the system of paths. It is determined up to 
entire linear transformations. . 

The equations (2.1) remain unaltered if we put, in the place of the functions 
l';,, the new functions 


(2.5) PM. = Th, — Wi dk — ve 5} 
where the y; are arbitrary point functions and the 6; Kronecker symbols. To 


the functions I’, corresponds a new affine parameter §. If we introduce $ as 
independent variable, the differential equations of the paths take the form 


Prt a; daridz* _ 


t 


ds? i* ag dé 


(2.4) = 0. 


0. 





(2.6) 


On the other hand, the equations (2.4) change” to 


Pai = 8" dat ; dx dx* 
(2.7 sin oe i, OA oe ee 
; de + @ ds ** as as 
by the introduction of § = §(s), and these equations must be identical with (2.6). 
With regard to (2.5), this gives 





(2.8) ds _ eS vided, 

ds 
When we transform the coérdinates x‘ by any analytical transformation 
(2.9) z= Zz (x1, x, +: x") 





8 J. Douglas, l.c., p. 144. 
‘The dashes signify differential coefficients: 8’ = d8/ds, s’’ = d?8/ds°. 
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whose functional determinant is different from zero for all points under con- 
sideration, the 5, transform according to the equations 


=: _ {as 92” dx" ax? \ az! 
They are, therefore, the components of a (symmetric) affine connection I. 
The same is true for the ,- Consequently, the y; in (2.5) are the components 
of a covariant vector. The transition from the I}, to the fi, according to 
(2.5) is called a projective change of the affine connection T.% The class K of 
affine connections belonging to the system (2.1) of paths (§1) consists of all affine 
connections given by (2.1), (2.2) and (2.5). 
Now we define a projective normal parameter x on the paths (2.1) by means 
of a differential equation of the form 


= o dx dx* 
(2.11) {r,s} = — 2MT;. = 
In (2.11), {v, w} is the Schwarzian derivative 
{v, u} = - sre 
’ ai dv 2 dv ’ 
du du 


M is a constant not zero and the I’, are given point functions symmetric in 
the lower indices: 


(2.12) r3, = Th;- 


By (2.11) z is defined as function of s up to linear fractional transformations. 

We demand that the parameter 7 remain unaltered by transformations of 
coérdinates and by arbitrary projective changes of the affine connection I 
(or, what amounts to the same, by the introduction of another affine parameter 
$ on the paths). From the first of this postulates, it follows that the I'$, are 
transformed by transformations of codrdinates according to 


=0 _ wO0 aa! da™ 
(2.13) lin = Dim Sai agF? 
because the affine parameter s remains unaltered (up to entire linear transforma- 
tions) by the transformations (2.9) of codrdinates. The I$, are, therefore, 
the components of a covariant tensor of rank 2. Moreover, by putting 


(2.14) tr, } = —omMTe, 


ik ae dé’ (f, = 1;) 





15H. Weyl, l.c., pp. 99-100. 
16 Without loss of generality, we could put M = 1. We do not particularise M with 
regard to later considerations (§6). 
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and by taking notice of 
ds\? 
(2:15) (n,8) = (SY te, 0} — (4,911, 


we find from the second postulate by means of (2.8), 


(2.16) iM. =Th. - af: (™ + an) _ eT, + Yj vn 


oak 


(2.16) are the equations of transformation of the I'}, by a projective change 
(2.5) of the affine connection I. 

Hence, a projective geometry of paths in IN (in the restricted sense) according 
to the definition of §1 can be fixed by giving n(n + 1)? point functions T',(x), 
T(x), symmetric in the lower indices and satisfying the laws of transformations 
(2.10), (2.5), (2.13), (2.16). 

If we perform successively, in an arbitrary order, a projective change (2.5) 
of the affine connection and a transformation (2.9) of codrdinates the T'},, I'S, 
change respectively to 


et m r 2 al yi l : 
(2.17) fi,= (r; i ll oe) e —v (= si + a 04), 


™ oxi az*  aziaz*] ax! az? oz* ' 


—— = Oi, Obm\ as dz! ax™ 
(2.18) | er = {Ta als (% + %e V,D im + viva} az az* 


Here, the fi,, T°, are functions of the 2‘, whilst the P'é,, P°$,, ¥; are functions 
of the x‘, 

The method explained above is not the only one for treating analytically a 
projective geometry of paths. Another method is given in §6. 


3. The gauge variable and the transformations of the representation. We 
start with a definitely choosen projective normal parameter 7. Corresponding to 
the transition from the projective normal parameter z to the affine parameter s, 
we introduce on every path a gauge variable x° by the substitution 


3. 0 = 

a : . 108 oe 

Let § be another affine parameter on the paths and 
1 dg 

0. A bord 

(3.2) z a log - 





'’ A different definition of the gauge variable has been given by Hlavaty, by means of a 
density of weight —1, respectively — 1/M(n + 1) (in our notation). See V. Hlavaty, 
Connexion projective et déplacement projectif. Annali di Mat. (4) 12 (1933-4), pp. 217-94, 
especially pp. 224-5; Systeme complet des invariants d’une courbe dans un espace projectif 
courbe. Abh. a. d. Seminar fiir Vektor- und Tensoranalysis II.-III., Moskau und Lenin- 
grad 1935, pp. 119-150, especially p. 121. 
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the gauge variable corresponding to the transition  — §. Then it follows from 
(2.8) that for any path 


(3.3) path I * yl) dat + C, 


where the integral is taken from an arbitrary but fixed point q on the path along 
the path and where C signifies an arbitrary constant. Generally y;(x)dz' is 
not an exact differential. For y,(z) = 0, the equation (3.3) contains also the 
special case that § is nothing but another determination as + b (a, b constants, 
a ~ 0) of the same affine parameter; in this case C = —(log a)/2M. 

To the different determinations of the same affine parameter therefore corre- 
sponds the continuous group #° = x° + ¢. 

Let us now introduce the parameter 7 as independent variable in the differ- 
ential equations (2.4) and (2.11). Since 


(ou) = (2) tayo, 


5 da’ dst 

‘dn dr’ 

On replacing in (3.4) and (2.4) the derivations ds/dz, d*s/dx*, d*s/dx* by r°, 
dx°/dr, d?x°/dx* aceording to (3.1), we find that the equations (3.4) and (2.4) 
can be written respectively as 


we have 


(3.4) (s, r} = 2MT 


d?x° dx®\?* » dx dx* 
(3.5) 2+ (=) +f ns 2 = 0, 
d? x dx® dx‘ ; dx dx 
3.6 putt ni st , eo ae 
3-6) dx? +S dr el dr ° 
Finally, we adopt the convention that Latin indices will take on values from 
1, ---., and Greek indices values from 0, 1, --- , n, and define the functions 
By by 
(3.7) (a) IS, = TS, 


(b) Igo = Wo, = M65. 
Then the equations (3.6) take the form 


d? x a Axe dxy _ 
a saa” 
Obviously it is also possible to make the steps which led from (2.4) and (2.11) 
to (3.8) in the inverse order. 
Hence, a projective geometry of the paths (2.1) can be regarded as the theory of the 


(3.8) 


3 
> 
; 
t 
% 
2 
¢ 
4 
j 
: 
L’ 








{ | uf 


un 


He 


Th 
on 
sys 
tio 


fol 

































2) skid Re HS ta 


ee eeey ee 








ON PROJECTIVE GEOMETRY OF PATHS S85 


differential equations (3.8), with the additional conditions all3,/ax° = 0 and (3.7b), 
under the transformations 


d# = dx® + zo Vix) dz, 
(3.9) 





‘a = a(Z', #2, --- , #) 

P= Mz', z, ---, 2" Rol at 0 
( ( ’ ’ ’ ); (x2 x, — a) a ) 
where the W(x) are arbitrary functions of x', x7, --- , x”. 

If the x* are interpreted as coérdinates in an (n + 1)-dimensional manifold 
+9? the equations (3.9) arise then and only then from a transformation of coér- 
dinates in *9t when y,(x)dz‘ is the exact differential of a function of zx', x’, --- x”. 
For 


(3.10) vi = 





o 2108 e 


axi ’ (p - p(z', 2, hie » 2”) 


we have from (3.9) 
(#9 = x + log p(z', a x"), 


(3.11) Pea 
|e F(z, #, Tr , 2°), (ss: » &) #0), 


a(z', 2, iain x”) 





Here p(z!, 2’, --- , 2") is an arbitrary function of its arguments. 

The transformations (3.11) are Veblen’s transformations of the representation. 
Thus, to restrict one’s-self to the transformations of the representation means, that 
one does not consider the total class K of all affine connections belonging to the 
system (2.1) of paths, but restricts one’s-self to the subclass K, of the affine connec- 
tions arising from an arbitrary but fixed affine connection T of the class K by the 
following special projective changes of the affine connection: 

0 log p 5: - M 2108 e 5: 


rie =7t,—M 
” ” ax* 


(3.12) dx’ 
(p arbitrary function of z', 2, ---, 2"). 
Otherwise expressed: We obtain the transformations of the representation by 
considering the subclass K; as representative for the whole class K. 
It is easy to characterize the subclass Ki. Let 


; rr, 
(3.13) a “— ins zh 


be the curvature tensor of the affine connection T. Then the subclass Ki con- 
sists of all affine connections of the class K which possess the same contracted 
curvature tensor 

(3.14) in = By — Bu, (By; = Bij) 
as the affine connection I.'* 








i m i m 
+ Ta jk” 4*Amk+ jl 





'8 See L. P. Eisenhart, Non-Riemannian Geometry, New York, American Mathematical 
Society, 1927, VIII + 184 pp., especially p. 91. This book will be referred to as N. R. G. 
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The 11%, are transformed by the transformations of the representation accord- 
ing to 


- Ox" Ox" O72" \ axe 
3.15 ‘, = (| 02.— — comet: Dennen 
(3.15) By ( °T 9x® AxT az =) Ox 


4. The curvature tensor of a projective geometry of paths. We shall now 
derive a tensor in 9 which is invariant under arbitrary projective changes of 
the affine connection T and represents the natural generalization of the Weyl 
projective curvature tensor. 

For that purpose, we consider the II, , defined by (3.7). According to (3.15), 
the Ij, are transformed, by the transformations of the representation (3.11), like 
the components of an affine connection II in the (n + 1)-dimensional manifold 
oe *M (§3). The components R%,, of the curvature tensor of the connection II 

vanish identically save R$, and! 


fe | (4.1) Ri, = Biy + MY$,6; — MYT%,8}. 
; In (4.1), Bi, is the curvature tensor (3.13) of the affine connection I in M. 


| The R‘,, are the components of a tensor in Mt. By a projective change (2.5) 
RP at of the affine connection I the tensor R;,, transforms into 


) 3 u (4.2) Rt. = Rin “+ Win 8) = 019) -m Vind§ 
th where 
(4.3) Ya = 4 


Equation (4.2) can be derived with the aid of (4.1), (2.16) and 2° 

fai i Oy; r i OY; r i i 
(4.4) Bix, si By it + (* 7 VT je + vatn)st ( ~t, jl > vit) ai Wind j- 
From (4.2), it follows that the contracted tensor 
(4.5) Rix = Riu = Dix + M(n = 1) io 
is transformed into 


' 1 
(4.6) Ry = Ry + ar Vik» 


by a projective change of the affine connection Tf. By introducing the value 
Vir = 3(Vjx — Yxj) from (4.6) in (4.2), we see that the tensor 

















a 1 Rye — Ri pf 4 Ll Ra — Ry 5 
j ik l 
n+l 2 n+1 2 
ong (4.7) " 
ei Peay 1 i 
i i fe + esi (Ra — Ru) 5; 
a 19 See O. Veblen, I.c., Journ. London Math. Soc., p. 157; P. R., p. 20. 


| 20 See L. P. Eisenhart, N. R. G., p. 87. Eisenhart’s notations differ slightly from ours. 
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remains unaltered by arbitrary projective changes of the affine connection I. 

If we replace the Rj, ,; and Rj, in (4.7) by their values (4.1), (4.5) we get finally 
1 Bu a Bi 

n+1 2 








‘Bin = Bins — ( - Mr',) 





(4.8) 








1 Bu — By F 1 é 
+(—4, oe # — Mrs,) a + 1 (Bu — Bu) 8. 





We call the tensor with the components *B‘, , the curvature tensor of the projective 
geometry of paths under consideration. It has the following properties 


(a) *Bi et = 0, 
(4.9) (b) *Biir + *By i, = 0, 
(c) * jer + *Bir; + *Bij, = 0. 


In §7 we come back to it once more. 













5. The preferred projective normal parameter. From (4.9, a, c), we see 5 
that the contracted curvature tensor *By, = *B},; of a projective geometry of vB 
paths is symmetric. From (4.7) and (4.8), it follows that } ren 


(5.1) *Ba = 4(Ryu + Re) = 3(Bu + Buy) + M (n — IT S,. 
By the equation 
(5.2) *By = 0 
a preferred projective normal parameter is defined (up to linear fractional trans- 
formations) which is already fully determined by the class K of affine connec- 
tions belonging to the system (2.1) of paths. From (5.1), (5.2), we have 
nh 1 Bi + Bij 
M(n — 1) -; 
and by substitution of this value in (2.11) we get the differential equation 


2 dxi da* 
(5.4) {x, s} oo Se 


which defines the preferred projective normal parameter. 

The class K of affine connections belonging to a system (2.1) of paths together 
with the preferred projective parameter determines a special projective geometry 
of paths. The curvature tensor of this special projective geometry of paths is 
the Weyl projective curvature tensor :! j 


1 
n—1 




















Bee ~~ 


aa 
a 








(5.3) rs , = 
















Wier = Beet + [(nBu + By) 5} — (nBu + Bis) 61] 









(5.5) 





$s Gy — a. 
n+1 








1H. Weyl, l.c., p. 101. 
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Indeed the equation (4.8) changes into - 5), by introduction of the value (5.3) 
for T'j,- Equation (5.2) gives Wi. = W5.; = 9. 

The curvature tensor *B‘,, of an arbitrary projective geometry of paths and 
the Weyl projective curvature tensor are connected by 





(5.6) San = *Bhay — Oat + at, 
as follows from (4.8), (5.1), (5.5). Thus, the equation 
(5.7) *Bie, = 0 

is equivalent to the two equations 

(5.8) | (.=0, *Be=0. 


6. The method of T. Y. Thomas. A second way of dealing with a projective 
geometry of paths results from a slight generalization of the method used by 
T. Y. Thomas in the research of projectively connected manifolds.” 

Let us call every system of functions I’ ;% (2), resulting in a fixed codrdinate 
system (z*) from (2.5) by a definite choice of the functions y;, a connection of the 
class K in the codrdinate system (x*), independently of the manner in which the 
v; and consequently the I’, are transformed by transformations of coérdinates. 
Under these connections theve exists a single normal connection characterised® 
by T;; = 0. We denote this connection by II and its components by IT} ,,”4 
so that we have 


(6.1) ny, = 0. 
A simple calculation gives 

t t 1 m t 1 m ry 
(6.2) jk = Te — —a Ua; 8k — —a Tnx 55. 


nm+1 "© n+1 
Consequently II corresponds to the choice 


1 ym 
(6.3) vi = re ss i 


of the y; in (2.5). The corresponding parameter § is the projective parameter p 
of T. Y. Thomas. According to (2.8) and (6.3) we have 


(6.4) Dp ae Se 
ds 


22 T. Y. Thomas, l.c. See particularly the exposition of this method in D. I., Chapter 
I.-III., and the discussion of J. H. C. Whitehead, l.c., §10. 

*3 See E. Cartan, Sur les variétés d connexion projective. Bull. Soc. Math. France 52 
(1924), pp. 205-41; J. M. Thomas, Asymmetric displacement of a vector. Trans. Amer. 
Math. Soc. 28 (1926), pp. 658-70, especially p. 664. Cf. also L. P. Eisenhart, N. R. G., 
p. 104. 

* The 1g , of this and the following paragraphs have nothing to do with the mt , of §3. 
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With the parameter p as independant variable the differential equations (2.1) 
of the paths take the form 
d?z* ‘ dzi dz* 
a II‘. =—_—_—_—_—- = R 
dp’ i dp dp ° 

The projective normal parameter x can now be defined by the differential equa- 
tion : 


(6.5) 


dx’ dx* 
(6.6) {r, p} _ 32MM, 
up to linear fractional transformations. In (6.6), the II}, are point functions 
symmetric in the lower indices: 


(6.7) I, = IS ;- 


The connection between the IT}, and the T'},, I$, (§2) is given by (2.16) if 
we substitute the II, for the f'}, and the values (6.3) for the y;. Thus, we get 


0 1 1/far; , or 2 
the = The — gee ey (ae tas) ~The 
St yo 
+ n+1 lr; ; =} 
By this method, the class K of the affine connections belonging to the system 
(2.1) of paths is represented in each coérdinate system by a single connection, 
the normal connection II. But, the condition fj; = 0 being invariant only 
under transformations of coérdinates with constant jacobian and not under 
arbitrary transformations of coérdinates, the II}, are not transformed by such 
transformations like the components of an affine connection, the projective 
parameter p is not invariant under transformations of codrdinates and the func- 
tions II}, are not the components of a covariant tensor of rank 2. The formulas 
of transformation for the IIj,, Il,, p by transformations of codrdinates are 
rather 





(6.8) 














=» » Ox™ Ox" ax” \ az! 1 @log (xz) ., 
Tl 5% _ ne “ant oak <r @ oe Ge j 5). 
(6.9) az’ at* = aX «E*) Ox n+1 dz! 
_ 1 @ log (zz) 5! 
n+1 az 
i’. = 1° az! dx™ 1 a éx! ax” a?x* ) az" 0 log (xz) 
6.10) ik tm 95 age " M(n +1) \\ '™ ag! at © ax att) ax* = aa" 
_ 1. Glog (2%) d log (zz) _ & log | 
n+1 az az az’ az* |’ 
o 2 
(6.11) |  _ ¢ (xz) HH, (C = const.). 


dp 
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Here and in the following developments, (x), (x) are respectively the func- 
tional determinants of the transformation (2.9) of codrdinates and of the inverse 
transformation. 

Here, the transition to the variable x° takes place by means of the equation 


1 dp 
corresponding to (3.1). The variable x° corresponding to another determina- 
tion ap + b(a, b constants, a ~ 0) of the projective parameter p is x° — 
(log a)/2M. By repeating the calculus of §3 with p, I1},, Il}, instead of s, 
r§,., I’, respectively and by introducing the convention of §3 on the Latin 
and Greek indices, we can put the system of the differential equations (6.5), (6.6) 
in the form 


(6.12) = 


a Gti dzy 


d2x7% . 
ri oe = 


Here the *I'§, are defined by 
“ye, = [1% 
7k jk) 


*T3, = *Te, = M oe. 


(6.13) 


(6.14) 


Since the parameter p is not invariant under transformations of codrdinates 
with variable jacobian, the variable x° corresponding to the transition  — p 
underlies not only the transformations #° = x° + const., but the transformations 


(6.15) P = 7 log (4x) + const., 


1 
~ Matt 


which follow from (6.11) and (6.12). Corresponding to this, the transformations 
(3.9) are here replaced by the transformations 


2? 


z antes Ss ale th 
(6.16) [2 M(n 4 1) log (zx) + const., 


G = Zi(z!, 22, ... , 2). ((4x) ¥ 0). 
From (6.13) and from the invariance of the parameter 7 under transformations 


of coérdinates, it follows that the *I'§, are transformed by the transformations 
(6.16) according to 


= dx" Ox" ax \ axe 
6.17 "Ts, = (TS, —| — + ——) —. 
—_ ” ( "og ogy | Oa cs) dae 
The theory of T. Y. Thomas results if we take M = — 1/(n + 1) and determine 


the functions II}, from the connection II by the equation 


(6.18) nm, ="tle 


.= 
: es Be. 
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corresponding to (5.3). Thereby, B5,, is the equiprojective curvature tensor 
that is the curvature tensor of the connection II, and ; 


’ 


1 all; i m 
(6.19) Bin = Sin = —- — Ta Tj; 
so that 
(6.20) Bir saad Bi; . 





7. The curvature tensor of the connection *I’. The equation (6.17) expresses 
that the functions *Ig,, defined by (6.14) behave, under the transformations 
(6.16), like the components of an affine connection *T in the (n + 1)-dimensional 
manifold *9 with coérdinates z*. The components *B3,, of the curvature 
tensor of this connection *I vanish identically, with the exception of *B°,,, 
*B‘,,. These components are 


(7.1) *Bi in = Bin + M11} ,.5; — MII},6;, 


(7.2) *Biu = amis _. 
. 7 








II}, 0 0 
ax! act + I}, ng = on Think ° 


From (6.20), we see that the equation (7.1) results if we substitute in (4.8), 
for the T'$,, I'j,, the I1$,, Ii, (§6) respectively. The *B},, are, therefore, the 
components of the curvature tensor of the projective geometry of paths under con- 
sideration, introduced in §4.75 


The *B5,, transform by the transformations (6.16) according to 


=) ax? aa’ axt 


ar 0 é 
(7.3) *Bo an _ (+83, + "Bert oxi az’ az* az! 


Hence, they are then and only then the components of a tensor in J when 
(7.4) *Biy = 0. 
For n > 2 we have 


. 1 - 1 i 
(7.5) M*BS, = San lia Bou — —- (*Biai — *Brar + *Baa), 


where the comma denotes the covariant derivative with respect to the I'},. 
In this case, we see that (7.4) has the consequence *B,, = 0. For n = 2, we get 


M*B xy sia 3(T; )*By = rj, Py) + *Biet — *Biak 


(7.6) ee a eee 
#(Bix.1 ‘t,k) 3\P x1 Uj,k/* 





** This method goes back to T. Y. Thomas, I.c., Math. Zeitschr., p. 732. Cf. also O. 
Veblen, Projective tensors and connections. Proc. N. A. 8. 14 (1928), pp. 154-166, especially 
§8.—(7.2) corresponds to the expression given: by J. M. Thomas, for the Weyl projective 
curvature tensor; see J. M. Thomas, Note on the projective geometry of paths. Proc. N. A. 8. 
11 (1925), pp. 207-9. 
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(7.6) follows, for n = 2, from the general formula 


* RO 1 


, ri * m * P 
oa gg Oe + gt ee 
— UB — Bux) — (Biya — Byrd}, 


from which also (7.5) can be derived by means of 


i - 
Tklt 


_ Bix) + nB.1 = Bix)} 





(7.8) ; 
+ pes (*Bu a a *Bi 4). 


From known theorems on the projectively flat spaces?’ and from the remark 
at the end of §5, it follows finally, with the aid of (7.5) and (7.6): 

For n > 2, the vanishing of the curvature tensor of a projective geometry of paths 
is a necessary and sufficient condition that the class K of affine connections belonging 
to the system of paths contain an euclidean-affine connection and that the affine 
parameter on the paths corresponding to this connection be at the same time a projec- 
tive normal parameter. For n = 2, the condition (7.4) must be replaced by 


(7.9) * ia = 0, * Bias = 0. 


8. The equivalence problem. We call two projective geometries of paths 
in an n-dimensional manifold IN equivalent if they have the same paths and the 
same projective normal parameter on the paths (up to linear fractional trans- 
formations). The problem: to decide if two given projective geometries of 
paths are equivalent or not, is the equivalence problem. It is solved if we can 
give necessary and sufficient conditions for the equivalence of two projective 
geometries of paths. Since this equivalence problem is not essentially different 
from the equivalence problem for two projectively connected manifolds,?* we 
can be brief. 

Let two projective geometries of paths be given by functions I'},(z), rs .(2) 
and ie (2), P'?,(2) respectively, in the manner of §2. Let *I3,(x) be the func- 
tions defined by (6.14), (6.2), (6. 8) and “Ts (x) the Se formed in the 
same manner by means of the T* ‘ (2), T j 0 (4), Then we find without difficulty 
from §6, (2.17), (2.18): A necessary and sufficient condition for the equivalence 
of the two projective geometries of paths is that there exist, in the (n + 1)-dimensional 
manifold *IN with coérdinates x« (§6), a transformation of the form (6.16) such 
that the *I'§,,(x) and the *1'3,(Z) are connected by (6.17). 


*6 For the special case M = —1/(n + 1), II .* [(n + 1)/(n — 1] By see T. Y. Thomas, 
D. I., p. 58, formulas (18.5)-(18.7). 

37 H. Weyl, l.c., pp. 104-5. 

28 Q. Veblen and J. M. Thomas, Projective invariants of affine geometry of paths. Ann. of 
Math. (2) 27 (1926), pp. 279-96; T. Y. Thomas, l.c., Math. Zeitschr.; J. H. C. Whitehead, 
lie. Cf. also T. Y. Thomas, D. I., §§16-19, 88. 
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Now J. H. C. Whitehead proved the following theorem:® Let, in the (n + 1)- 
dimensional manifold *IN, the connections *T and *T, with components *r3, and 
*T3,, respectively, be affinely equivalent, that is, let there exist in *M a transformation 
of coérdinates 


(8.1) z= (2, zi, ‘Ceili Tn 







with not-vanishing jacobian such that the *1'3, and *T$,, are connected by (6.17). 
Then there always exists at least one transformation of the particular form (6.16) 
possessing the same property. 

With the aid of this theorem we have finally: 

A necessary and sufficient condition for the equivalence of two projective geometries 
of paths in an n-dimensional manifold M, given by T5,(x), TY,(x) and r',(2), 
I'S.(@) respectively, is that, in the (n + 1)-dimensional manifold *§ with co- 
ordinates x, the connections *I', *T defined by (6.2), (6.8), (6.14) and the corre- 
sponding equations be affinely equivalent. 

By this theorem the equivalence problem for two projective geometries of ii} 
paths in an n-dimensional manifold Yt is reduced to the affine equivalence 
problem for two particular affine connections in an (n + 1)-dimensional mani- 
fold.%° 




















9. General projective geometry of paths. A general affine connection in an 
n-dimensional manifold M is a geometric object ! whose components I'},(z, 2) 4 
in a determined coérdinate system (x‘) are functions of the 2n variables 
x‘, z‘, homogeneous of the degree zero in the <‘, and are transformed by the 
transformations (2.9) of codrdinates according to (2.10) if the #‘ are transformed 
like the components of a contravariant vector. We consider only symmetric 
general affine connections, that is, such that (2.2) is valid. 

By a path in the general sense we understand an integral curve x‘ = z‘(t) of a 
system of differential equations of the form 


(9.1) a (Ge + ar (=, %)) “(5 + 2r 2,5 ‘ 


where the functions I'(z, #) are homogeneous of the second degree in the 2*. 


By the substitution 
as @at | det 
ae ot (=, @) 


ds dx’ 


dt 
the equations (9.1) take the form 
ax! if, az\ _ a 


*® J. H. C. Whitehead, l.c., §8. a || 
0 For this problem see e.g. T. Y. Thomas, D. I. §§81-85. 
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s is called an affine parameter of the system (9.1) of paths. It is determined a 
fe up to entire linear transformations. or 
a Every affine connection I such that (0 
(9.4) Ti,(x, z)¢iz* = 2T(2, 2) tl 
has not only the paths (9.1) for autoparallel curves but corresponds moreover (¢ 
to the same affine parameter s.** Under these connections I’, the connection 
with the components T'j, = 6°I'‘/aé‘a¢* is preferred. b 
The equations (9.1) do not change if we put, in the place of the functions I‘, 
the new functions (( 
(9.5) T(x, é) = (a, 4) — V(a, 2) a, : 
i where ¥(x, #) is an arbitrary function homogeneous of the first degree in the z‘. — fc 
f To the functions I corresponds the form t] 
art d?xt , dz = 
: f (9.6) Te + 2r% (=, =) 0 
t 
i of the differential equation of the paths, with the new affine parameter §. Along ; a 
ah any path, the connection between s and § is given by 
‘eo } 
Lt 
i Le (9.7) es e-2 J v(z,dz) | 
hike ds V 
a The class K of affine connections belonging to the system (9.1) of paths consists ( 
of all symmetric affine connections T whose components f(z, &) satisfy the con- 
ditions ( 
(9.8) fi aia* = Ti aie — Quai. ) ] 
In (9.8), the I'5, are the components of an arbitrary symmetric general affine ; 


connection IT entiating (9.4) and y has the meaning stated above. With 
transformations of coérdinates, y is a scalar. 
A projective normal parameter x on the paths (9.1) can be defined by 


scoala AN 
~~ 


( 
(9.9) {r,s} = —4MTr° (=, *), 
ds 


up to linear fractional transformations. In (9.9) M is a given constant different 
from zero and I'°(z, #) a given function homogeneous of the second degree in the 





31 An example of two general symmetric affine connections related in this way is given | 
by the two connections with the components G _and a -in E. Cartan, Les espaces de Fins- 7 
ler, Actualités scientifiques et industrielles 79, ‘Hermann et Cie, Paris 1934, 42 pp., espe- 
cially pp. 18-9. : 

' % For a general geometry of paths, it would be preferable to take as the starting point i 
ee as of the theory not the components .. of the connections satisfying (9.4) but the functions I“ q 


themselves, considered as the components of a geometric object in the coérdinate system § 


(x*). 
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¢. We ask that the parameter 7 shall not alter by transformations of co- 
ordinates and by projective changes of the affine connection I according to (9.8) 
(or, what amounts to the same, by transition to another affine parameter § on 
the paths). This gives for T° the law of transformation 


(9.10) T(z, #) = T(x, 3), (i — 3 i) 
by the transformations (2.9) of codrdinates and 


i oe Se pM. . 
(9.11) r= ay (he aan +¥) 
by a projective change of the affine connection I according to (9.8). (9.11) 
follows by means of (2.15). In the functions on both sides of the equation (9.11) 
the arguments are 2‘, <'. 

We now introduce the gauge variable x° by means of (3.1), take the parameter 
7 as independent variable in (9.3) and (9.9) and adopt the convention of §3 on 
the Latin and Greek indices. Then we can comprehend the equations (9.3) 
and (9.9) in the form 


(9.12) | i + 2P« («, ) = 0, 
where the P*(z, z) are defined by 

(9.13) Px, z) = $M(z°)? + P%z, 2), 
(9.14) Pi(z, z) = Mx x + T(z, z). 


In an analogous manner as in §3, we find here: 

A projective geometry of the paths (9.1) can be considered as the theory of the 
differential equations (9.12) where the P*« are given by (9.13), (9.14), under the 
transformations 


dg = da! + + ¥(c, da), 


(9.15) ((Zx) # 0). 


= F(z", 2’, dia z*), 
In (9.15), Y(x, dx) ts an arbitrary function of the x‘, dx‘, homogeneous of the first 
degree in the dz‘. 
The equations (9.15) arise then and only then from a transformation of co- 
ordinates in the (n + 1)-dimensional manifold *9t (§3) when we restrict our- 
selves to the changes 


(9.16) r= -—- yw—_— 





of the functions I‘, where p is an arbitrary function of z', z*, --- , 2” alone. 
This signifies that we consider only the subclass K, of the class K of affine con- 
nections belonging to the system (9.1) of paths which consists of all affine connec- 
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tions of the class K connected with an arbitrary but cefinitely choosen affine 
connection I of this class by 

(9.17) li ,iet = Pi,aic’ — 2M —— aie. 

Then the transformations (9.15) reduce to the transformations of the representa- 


tion (3.11). By a transformation of the representation, the P* transform 
according to 








( ss. * = — ( Tr U o- £8 & ) Ox 
| Pn(2, 2) = Pr(z, #) + 5 ea 2 ral — 
(9.18) { 


| ( az ) 
rv = — zr}. 
( ax* 


The transform, consequently, like the components of a general affine 





dx* axF 
connection in the manifold *2. 
In the general projective geometry of paths, there exists also a preferred 
projective normal parameter which is the generalization of that introduced in §5. 
For it can be shown without difficulty that the scalar* 


i 2pi 27% 
_ 9 rT rT 


Ox' Ox' ox” Ox ax”™ 


19 . 
—_ - aa” aa 














transforms into 


(9.20) 2B = 28 + (n—1)(% #— 2% r+ ys) 
ox" ax! 


by the changes (9.5) of the functions I“. 
scalar 


(9.21) 


Hence we see from (9.11) that the 


*B=B+M(n— Dr 


which is homogeneous of the second degree in the <‘ remains unaltered by the 
transformations (9.5). The postulate *B = 0 gives, therefore, 


1 


rm = 


If we introduce this value of T° in (9.9) this equation defines the preferred projec- 
tive normal parameter. up to linear fractional transformations. 





33 In the case of §5, we have 
2B = By riz, 2*B = *B,, ciz*. 


These equations remain valid for a general geometry of paths if we put, for the By, *Bjx, 
the values (9.25), (9.27) respectively. 
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In the case of the general projective geometry of paths, the Weyl projective 
curvature tensor has the components 





F 1 
Wher = Byer + S— lmBa + By) di — (mB + Bui) 81) 

















1 
+ er | (Bu — Bu) 3; 
(9.23) . 1 , 
+¥—, lan (Bip — By) 6, — a7 (Bio — Bou) 8; 
1 . 
oF 537 (Bu — Bu) is, 





where 











jk = ( te. ee, 2c ( ar ar = 
(9.24) — aa at" Ox sk OR" OE™ Aa! dz at Ox* aa ad ak” az* | i 
1 

+ er ern a ere germ ! 
ax” aa! aa ac* = A&™ A* AX? Az! 















and 
























(9.25) Bu = Bini 
tf 
The curvature tensor of the projective geometry of paths under consideration is ‘ b, 
j ' , *B. *Bo .. ati 
} (9.26) "Bhar = Wher + —* 3) — ——* tk, | 







where 





a*B 
* —,—— 
(9.27) Bs = Soy 





From (9.26) and (9.23), it follows that 
(9.28) *By = *BS,;- 


Besides the tensor *B‘,,, we have in the general projective geometry of paths 
| still the tensor* 


i F 38 i 1 or™ °¢ 
(9.29) Mijes - ware (F zie) 


and the tensor 













a*B a Bu 
an at oa! =a! 





(9.30) *Bi = 













4 J. Douglas, |.c., p. 157, equation (5.11). 
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which are of importance. The equation 
(9.31) Ii,,; =0 


is a necessary and sufficient condition that a general projective geometry of 
paths have the same paths as a restricted projective geometry of paths,* the 
equations 


(9.32) 1 = 0, *Bixt = 0 


are necessary and sufficient conditions that it be itself a restricted projective 
geometry of paths.** Therefore (§7), the equations 


(9.33) Ili ey = 0, *Bo a: = 0, 


from the second of which *By,. = 0 is a consequence, are, for n > 2, necessary 
and sufficient conditions that the class K of affine connections belonging to the 
system (9.1) of paths contain an euclidean-affine connection and that the affine 
parameter on the paths corresponding to this connection be a projective normal 
parameter. 

For sake of brevity, we do not enter into the generalization, to the general 
geometry of paths, of the method set forth in §6, a generalization which offers 
no difficulty. 


PRAGUE (CZECHOSLOVAKIA), 
GERMAN UNIVERSITY. 





% J, Douglas, l.c., p. 157. 


*° For the proof of this theorem, the generalization of the method of §6 would be neces- 
sary. 
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Introduction 









Considérons un systéme dynamique, dont le mouvement est donné par les 
équations différentielles: 






Gi we Xe (ti, +++ 5) (¢=1,2,.--,n) 
ol 11, --- ,%n sont les paramétres qui déterminent la position du systéme et od ; 
t désigne le temps. i 
Nous considérons l’ensemble des valeurs 2, --- ,2, comme un point p de 
coordonnées (2%, ---,2n) dans l’espace Euclidien EH" & n dimensions. Les ' 
mouvements des systémes dynamiques seront caractérisés par la position des 1 e 
points sur les trajectoires décrites par ces points p C E*: nous désignerons la i 
position du point p sur la trajectoire 4 |’instant ¢ par f(p, t); f(p, 0) = p. 
Supposons que les fonctions X; dans les seconds membres des équations \ 
différentielles soient continues par rapport 4 tous les arguments et qu’elles satis- * I 
fassent & la condition de Lipschitz dans un certain domaine fermé D. Ce 4 | 
n’est que sous certaines restrictions que nous aurons le théoréme de |’existence 
et de l’unicité des solutions ainsi que le théoréme de la dépendance continue des | Lb 
trajectoires des conditions initiales. ht 
Ce dernier théoréme peut étre énoncé de la maniére suivante: Soit p un point 
fixe et g un point arbitraire de l’espace E*; alors pour tout « > 0 et pour tout 
nombre positif 7’ arbitrairement grand, on peut indiquer un nombre 6 > 0 assez 
petit pour qu’on ait p(f(p, t), f(q, #)) < € pour toute valeur de ¢, appartenant 4 
lintervalle 0 < ¢ S T lorsque p(p, q) < 4; p(x, y) désignant la distance entre les 
points x et y dans E*, 
Nous considérons dans la suite les ensembles invariants, définis de la maniére 
suivante: l’ensemble M C E* est dit invariant si f(p, t) C M, pour tout p C M. 
Dans la dynamique générale une place de grande importance revient 4 cer- 
tains mouvements qui représentent une généralisation des mouvements péri- 
odiques. Ce sont les mouvements stables au sens de Poisson et comme cas 
particulier, les mouvements récurrents de Birkhoff. La trajectoire f(p, t) (p 
_  ¢tant fixé) est dite stable au sens de Poisson si pour tout nombre positif ¢ et pour 
| tout nombre positif 7 arbitrairement grand il existe des valeurs de ¢—tant 
positives que négatives et surpassant 7’ en valeur absolue, pour lesquelles on a: ‘ 


p(p, f(p,t)) < « 
899 
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Un mouvement stable au sens de Poisson est dit récurrent si A tout € > 0 il cor- 
respond un nombre positif constant rt = r(e) tel que dans l’intervalle de temps 
de longueur 7 tout point du mouvement f(p, ¢) passe 4 une distance inférieure 
d e de tout point de la trajectoire du mouvement considéré. 

Les mouvements récurrents ont été introduits et étudiés par Birkhoff. Les 
principaux résultats de ses études peuvent étre résumés comme il suit :! 

Un ensemble invariant, compact et fermé = est dit minimal, s’il ne peut con- 
tenir un sous-ensemble différant de = et jouissant des mémes propriétés. Birk- 
hoff a démontré que tout ensemble invariant, compact et fermé renferme au 
moins un ensemble minimal. De plus les recherches de Birkhoff ont établi 
que les ensembles minimaux sont intimement liés aux mouvements récurrents: 
tous les mouvements d’un ensemble minimal sont récurrents et tout mouve- 
ment récurrent est contenu dans un ensemble minimal. La trajectoire de tout 
mouvement récurrent est dense partout dans l’ensemble minimal qui contient 
le mouvement. 

Dans cet article j’introduis une notion nouvelle. Je désigne un ensemble 
invariant compact et fermé comme un ensemble dynamiquement indécomposable, 
s’il ne peut étre représenté comme une somme de deux ensembles qui soient 
différents de l’ensemble donné et qui jouissent des mémes propriétés, c’est 4 dire 
qui soient invariants et fermés. Les ensembles dynamiquement indécompos- 
ables présentent une généralisation de la notion des ensembles minimaux, ceux-ci 
étant dynamiquement indécomposables. Les ensembles dynamiquement indé- 
composables forment deux classes distinctes, celle des ensembles indécomposables 
ordinaires et celle des ensembles indécomposables exceptionnels. Je nomme en- 
suite les ensembles ordinaires indécomposables—ensembles quasi-minimaux, 
car ce sont précisement ceux qui représentent une telle généralisation des ensem- 
bles minimaux qui permettent d’inclure les mouvements stables au sens de 
Poisson. 

Le réle des ensembles quasi-minimaux dans la dynamique est caractérisé 
par les théses suivantes qui forment le résultat principal de ce mémoire: tout 
ensemble stable au sens de Poisson appartient d un ensemble quasi-minimal, la 
trajectoire d’un tel mouvement élant partout dense dans cet ensemble, et tout mouve- 
ment quasi-minimal contient un mouvement stable au sens de Poisson, dont la 
trajectoire est dense partout dans cet ensemble. 


1 


Nous dirons que |’ensemble invariant, compact et fermé © des points du sys- 
téme dynamique est dynamiquement indécomposable, s’il ne peut étre représenté 
comme la somme de deux ensembles différents de © qui jouissent des mémes 
propriétés, c’est 4 dire qui soient fermés et invariants. 

THEOREME I—Soit 0 un ensemble dynamiquement,décomposable et G un domaine 
relatif arbitraire de cet ensemble; alors ensemble des points appartenant aux tra- 
jectoires qui traversent le domaine G est partout dense dans ®. 





1 Birkhoff: Dynamical systems. N.Y. 1927 Ch. VII. 
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Admettons que le théoréme ne soit pas vrai. Alors on pourrait indiquer dans 
l'ensemble © deux domaines relatifs G et G’ tels qu’aucune trajectoire traversant 
le domaine G ne croise le domaine G’. Considérons l’ensemble des points, 
appartenant aux trajectoires qui traversent le domaine G et désignons par 0; 
la fermeture de cet ensemble. L’ensemble ©; obtenu sera fermé, de plus, il est 
invariant, comme la fermeture d’un ensemble invariant. 

Il est évident que ©; 9, car aucun point du domaine G’ C © n’appartient 
i l’ensemble 0, . 

Considérons ensuite l’ensemble 6 — 0, et soit @2 la fermeture de cet ensemble. 
Il est évident que 6: # © car aucun point du domaine G C © n’appartient aA 
l’ensemble Qs . 
Nous avons done 


6 = 6 + ®, 


0, et GO, étant fermés et distincts de 0. 

Le résultat obtenu contrédit la supposition que l’ensemble 0 est dynamique- 
ment indécomposable, et le théoréme est démontré. 

THEOREME II—Tout ensemble dynamiquement indécomposable © contient un 
mouvement f(r, t) tel que sa trajectoire est partout dense dans cet ensemble. 

Soit G un domaine relatif arbitraire dans l’ensemble 6. Donnons nous un 
nombre arbitraire « > 0 et montrons qu’il existe un domaine G, C G tel que 
toute trajectoire croisant le domaine G; , passe 4 une distance inférieure A « 
de tout point de l’ensemble 0. 

Recouvrons l’ensemble 0 par un nombre fini de domaines, dont les diamétres 
ne surpassent pas « , ce qui est possible l’ensemble 6 étant compact. 

Soient G!, G2, .-- , G@" ces domaines, rangés dans un ordre arbitraire. 

Quel que soit le domaine G, il se trouvera toujours, en vertu du théoréme 
précédent, un point p C G tel que la trajectoire déterminée par ce point f(p, 1) 
croisera le domaine G'. Par conséquent on peut indiquer un moment du temps 
t; pour lequel f(p, ti) C G'. En vertu du théoréme de la dépendance continue 
des trajectoires du point initial on peut décrire autour du point p un voisinage 
g’ Cgsi petit que f(g’, 4) CG. 

Il est évident que toutes les trajectoires qui sortent du domaine G’ traversent 
le domaine G1. De méme on peut choisir dans le domaine G’ un point p’ tel 
que la trajectoire que ce point détermine, traverse le domaine G* et on peut 
indiquer un moment du temps é& pour lequel f(p’, 2) C G*. Décrivons autour 
du point p’ un voisinage g’”’ C g’ si petit que f(g”, &) Cg’. Toutes les trajec- 
toires qui sortent du domaine G” traverseront le domaine G’ et G’. 

En poursuivant ces raisonnements jusqu’A ce que tous les domaines 
G', G?, ... , GN, soient épuisés nous obtenons un systéme décroissant de do- 
maines: GD G’ DG’ D.-. DG. Nous le compléterons par le choix d’un 
domaine G; assez petit pour que sa fermeture G,C G) CG. Toute trajectoire 
croisant le domaine G,, passera 4 une distance inférieure 4 € de tout point de 
ensemble 0. 
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Soient 6, @&,---:,é€n,-°-: une suite de nombres positifs convergeant vers 
zéro. En appliquant le procédé décrit plus haut nous pouvons construire con- 
formément aux nombres de cette suite une suite de domaines: G;, Ge, --- , 
Gn, +--+ telles que: 1) la fermeture de chaque domaine est contenue dans les 
domaines précédants; 2) toute trajectoire traversant le domaine G;, passe 4 une 
distance inférieure 4 ¢, de tout point de l’ensemble 9. 

Une telle suite de domaines a une partie commune qui n’est pas vide. La 
trajectoire f(r, t) déterminée par tout point r appartenant 4 cette partie commune 
est partout dense dans 9, car elle est fait partie des faisceaux de trajectoires qui 
sortent de ces domaines G; . 


2 


Considérons un ensemble 6 dynamiquement indécomposable. Désignons 
par Q» l’ensemble des points des trajectoires des mouvements périodiques et des 
points de repos. Nous dirons que l’ensemble 0 est un ensemble indécomposable 
ordinaire si pour tout e > 0 le e-voisinage de chaque point p C © — ®p contient 
des points de 9 qui n’appartiennent pas 4 la trajectoire du mouvement f(p, ¢). 

Dans le cas ot on peut indiquer un tel point p C 6 — Op et un nombre positif 
€ assez petit pour que le e-voisinage du point p ne contienne que des points de 0 
appartenant a la trajectoire du mouvement f(p, ¢) nous dirons que l’ensemble 
6 est un ensemble exceptionnel et que la trajectoire f(p, t) est une trajectoire 
caractéristique pour cet ensemble exceptionnel. 

Considérons les exemples suivants, comme exemples simples d’ensembles 
exceptionnels. 

Exemple 1.—Examinons les mouvements des points d’un espace Euclidien 4 
deux dimensions E?, déterminé par les équations différentielles: 


a = —ylat + (1 - 
OY = aa? + (1 — y)} 


ou z et y sont des coordonnées cartésiennes du point. Les trajectoires de ce 
systéme dynamique seront constituées du point de repos 4 l’origine, d’un en- 
semble de circonférences ayant leur centre commun 4 |’origine et de rayons 
r ~ 1, et des trajectoires situées sur la circonférence 2? + y? = 1. Les point 
de cette derniére circonférence se partagent en deux ensembles appartenant a 
deux mouvements différents: l’un est le point de repos avec les coordonnées 
(0, 1) l’autre a pour trajectoire les autres points de la circonférence. 
L’ensemble formé par les deux trajectoires sera un ensemble indécomposable 
et exceptionnel, et la seconde trajectoire une trajectoire caractéristique. 
Exemple 2.—Supposons que les mouvements des points de l’espace E? sont 
déterminés par les équations différentielles: 
= —plnp 
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ov p et g sont les coordonnées polaires du point mobile et od a est une constante 


quelconque. 
Les équations des trajectoires seront: 


— gCe—e¢ 
p=e ’ 


ou C est la constante d’intégration. 

L’ensemble des trajectoires du systéme dynamique aura l’aspect suivant: 
4 l’origine des coordonnées se trouve un point de repos. La circonférence 
p = 1 sera la trajectoire du mouvement périodique. Les points situés dans la 
couronne 0 < p < 1 décrivent des spirales se rapprochant asymptotiquement 
de la circonférence p = 1 pour t > + ~ et de|’origine pour t > — , 

Examinons l’ensemble formé par les trois trajectoires suivantes: par le point 
de repos, par la circonférence et par la spirale décrite par un point queleonque 
situé 4 l’intérieur de cette circonférence. Cet ensemble sera un ensemble indé- 
composable et exceptionnel, la spirale étant sa trajectoire caractéristique. 

TutoriME III. Un ensemble indécomposable exceptionnel © ne contient qu’un 
seul mouvement tel que sa trajectoire est partout dense dans 0, cette trajectoire étant 
la trajectotre caractéristique. 

En effet, soit f(r, ) la trajectoire d’un mouvement caractéristique de l’ensem- 
ble 6. Décrivons autour du point r un voisinage du rayon ¢ assez petit pour que 
le e-voisinage du point r ne contienne que des points appartenant A la trajectoire 
f(r, 0). 

L’ensemble © contient immanquablement un mouvement dont la trajectoire 
est partout dense dans cet ensemble. En supposant que cette trajectoire différe 
de f(r, ¢), nous arrivons 4 la conclusion que le point r ne sera pas son point 
limite, ce qui est une contradiction. Nous aboutissons également 4 une contra- 
diction en admettant qu’il existe 4 part le mouvement f(r, ¢) une autre mouve- 
ment, dont la trajectoire est partout dense dans 8. Le théoréme est démontré. 

THtorREME IV. Aucune semi-trajectoire du mouvement caractéristique f(r, t) 
de ’ensemble exceptionnel dynamiquement indécomposable © ne peut étre partout 
dense dans cet ensemble. 

Admettons que l’une des semi-trajectoires du mouvement f(r, t), par exemple 
la semi-trajectoire positive, soit partout dense dans l’ensemble 8. Dans ce cas 
tous les points de la semi-trajectoire négative du mouvement f(r, t) seront des 
points limites w de sa trajectoire positive. Mais alors en vertu du théoréme 
de la dépendance continue des trajectoires des condition initiales, les points de 
toute la trajectoire du mouvement f(r, t) seront des points limites w de la trajec- 
toire positive de ce méme mouvement. 

D’un autre cété, la trajectoire f(r, t) étant une trajectoire caractéristique, 
on peut indiquer un e > 0 tel que le e-voisinage du point r ne contienne que des 
points appartenant & la trajectoire f(r, t). Construisons dans le point rl’élément 
d’un plan perpendiculaire a la trajectoire f(r, t). Décrivons dans ce plan autour 
du point r un vosinage sphérique fermé (par rapport 4 l’élément du plan) avec 
un rayon qui ne surpasse pas ¢ et qui en outre soit assez petit pour que ce voisin- 
age, avec les points de sa frontiére, appartienne a ]’élément considéré. 
Examinons l’ensemble dénombrable des points d’intersection de la semi- 
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trajectoire positive du mouvement f(r, t) avec le e-voisinage que nous venons de 
construire. En vertu de nos suppositions tout point de cet ensemble sera un 
point limit w du mouvement f(r, ¢) et par conséquent |’ensemble lui-méme sera 
un ensemble parfait. Mais alors il ne peut pas étre un ensemble dénombralle. 
Nous arrivons a une contradiction et le théoréme est démontré. 

THEOREME V. Soit 6 un ensemble ordinaire dynamiquement indécomposable 
et soit f(r, t) une trajectoire partout dense dans cet ensemble. Désignons par 0, 
et 0, Vensemble des points limites a resp. w de la trajectoire f(r, t); alors 8 = 
6. + 9,. 

Admettons que le théoréme ne soit pas vrai. Alors l’ensemble 6’ = 6 — 
(0. + 9.) n’est pas vide. On sait que les ensembles 0, et 9,,, et par conséquent 
leur somme sont des ensembles fermés. Cela étant, l’ensemble 0’ sera un 
ensemble ouvert relativement 4 8. Par conséquent il se trouvera un domaine 
G = 0’ qui ne renfermera pas des points des ensembles 0, et 9... 

L’ensemble © étant un ensemble ordinaire on peut indiquer un point p C G 
qui ne sera pas situé sur la trajectoire du mouvement f(r, ¢). Le point p n’appar- 
tiendra évidemment pas au points limites a ou w du mouvement f(r, #). Cela 
étant, on peut décrire autour du point p un voisinage G’ C G d’un diamétre 
assez petit pour qu’il ne contienne aucun point de la trajectoire du mouvement 
S(r, }). Mais ceci contredit la supposition que la trajectoire du mouvement 
f(r, t) est partout dense dans 6. Le théoréme est démontré. 

THEOREME VI. Soit 0 un ensemble ordinaire dynamiquement indécomposable 
et f(r, t) une trajectoire partout dense dans cet ensemble. Alors au moins l'une des 
semi-trajectoires de ce mouvement sera partout dense dans 0. 

Admettons de nouveau que ce théoréme ne soit vrai, c’est 4 dire qu’aucune 
des semi-trajectoires f(r, t) ne soit partout dense dans 8. Désignons respective- 
ment par 0, et 9, les ensembles des points limites a et w du mouvement f(r, t). 

Les ensembles 9, et 9, sont des ensembles fermés invariants qui, selon nos 
hypothéses, seront les parties vraies de l’ensemble 6. De plus en vertu du 
théoréme précédent 6 = 0, + @,. 

Les conclusions obtenues contredisent l’indécomposabilité de l’ensemble 0, 
et le théoréme est démontré. 


3 


Nous dirons qu’un ensemble invariant, compact et fermé est un ensemble 
quasi-minimal et nous le désignerons par le signe 2 si cet ensemble est dyna- 
miquement indécomposable ordinaire. 

THtorfME VII. Tout mouvement stable au sens de Poisson f(p, t) dont la 
trajectoire est compacte, appartient d un ensemble quasi-minimal Q, la trajectoire 
de ce mouvement étant partout dense dans Q. 

Désignons par © |’ensemble des points appartenant a la fermeture de la trajec- 
toire f(p, t). L’ensemble Q est invariant, compact et fermé; de plus la trajectoire 
du mouvement f(p, t) est partout dense dans 2. 

L’ensemble 2 est un ensemble quasi-minimal; en effet tout sous-ensemble 


a, Lenensnediel 


le las tones! eh ete 





in’ 
in 


tic 


se 
tic 


ste 
se 











Ts tenants 








ENSEMBLES QUASI-MINIMAUX 905 


invariant fermé contenant f(p, ¢) coincide necessairement avec 2, done Q est 
indécomposable. 

Il nous reste 4 montrer que l’ensemble 2 ne peut pas étre un ensemble excep- 
tionnel. En effet si cet ensemble était exceptionnel, alors en vertu du théoréme 
IV la trajectoire f(p, t) serait une trajectoire caractéristique et aucune de ses 
semi-trajectoires ne serait partout dense dans 2. Mais ceci contredit la défini- 
tion de la stabilité au sens de Poisson. Le théoréme est démontré. 

TutorbME VIII. Tout ensemble quasi-minimal Q contient un mouvement 
stable au sens de Poisson f(p, t), dont la irajectoire est partout dense dans cet en- 
semble. 

L’ensemble © étant indécomposable, il en découle |’existence d’un mouvement 
f(r, }), dont la trajectoire est partout dense dans 2. L’ensemble Q étant un 
ensemble ordinaire, au moins l’une des semi-trajectoires du mouvement f(r, ¢) 
sera partout dense dans cet ensemble; admettons pour fixer les idées que cela 
soit la semi-trajectoire positive. 

Assignons nous un nombre positif arbitraire « et montrons que dans tout 
domaine relatif Go C Q il se trouvera un domaine G,, tel que tout point du 
domaine G; ayant quitté le domaine Gp , passera au moins une fois A une distance 
inférieure 4 €, de tout point de l’ensemble © et retournera ensuite dans le domaine 
Gy , le temps variant dans un sens positif ou négatif. 

Recouvrons l’ensemble 2 par un nombre fini de domaines G', G?, --- , G” dont 
les diamétres ne surpassent pas «,. Considérons le systéme des domaines 
@, G?, --- , GY, GY+1 ot Gat! = G. 

Des propriétés du mouvement f(r, t), dont la trajectoire est partout dense 
dans Q, découlent la possibilité de choix du point initial r tel que r C G, et 
l’existence d’une suite finie de nombres positifs croissants tf, f, --- ,tw, tase 
telle que 


flr, t) CG (i = 1,2,---N,N +1). 


En vertu du théoréme sur la dépendance continue des trajectoires des condi- 
tions initiales, on peut décrire autour du point r un domaine G’ C G) assez petit 
pour que 

{G@,t) CE (¢ = 1,2,---,N,N + 1). 


Alors, le temps variant dans le sens positif, tout point du domaine G’ passera 
au moins une fois 4 une distance inférieure 4 «, de tout point de l’ensemble 2 
et retournera ensuite dans le domaine Go. 

Considérons le domaine G”’ = f(G’, twi1) C Go. Tout point de ce domaine 
jouit des mémes propriétés que les points du domaine G’, mais seulement pour 
le temps variant dans le sens négatif. 

Nous pouvons appliquer 4 présent au domaine G”’ les raisonnements que nous 
avons faits plus haut par rapport au domaine Go, et construire un domaine 
G, CG” CG, tout point duquel passera au moins une fois 4 une distance infé- 
rieure 4 ¢, de tout point de l’ensemble & et retournera ensuite dans Go. 

Prenant en considération que G; C G@” nous arrivons 4 Ja conclusion que tout 
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trajectoire positive du mouvement f(r, ¢) avec le e-voisinage que nous venons de 
construire. En vertu de nos suppositions tout point de cet ensemble sera un 
point limit w du mouvement f(r, ¢) et par conséquent l’ensemble lui-méme sera 
un ensemble parfait. Mais alors il ne peut pas étre un ensemble dénombralle. 
Nous arrivons 4 une contradiction et le théoréme est démontré. 

THEOREME V. Soit © un ensemble ordinaire dynamiquement indécomposable 
et soit f(r, t) une trajectoire partout dense dans cet ensemble. Désignons par 0, 
et 0, Vensemble des points limites a resp. w de la trajectoire f(r, t); alors 8 = 
6. + 9. 

Admettons que le théoréme ne soit pas vrai. Alors l’ensemble 6’ = 0 — 
(0, + 0.) n’est pas vide. On sait que les ensembles 0, et 9,,, et par conséquent 
leur somme sont des ensembles fermés. Cela étant, l’ensemble 0’ sera un 
ensemble ouvert relativement 4 6. Par conséquent il se trouvera un domaine 
G = 0’ qui ne renfermera pas des points des ensembles 9, et 9... 

L’ensemble © étant un ensemble ordinaire on peut indiquer un point p C G 
qui ne sera pas situé sur la trajectoire du mouvement f(r, t). Le point p n’appar- 
tiendra évidemment pas au points limites a ou w du mouvement f(r, t). Cela 
étant, on peut décrire autour du point p un voisinage G’ C G d’un diamétre 
assez petit pour qu’il ne contienne aucun point de la trajectoire du mouvement 
f(r, t). Mais ceci contredit la supposition que la trajectoire du mouvement 
f(r, t) est partout dense dans 6. Le théoréme est démontré. 

THtorrME VI. Soit 0 un ensemble ordinaire dynamiquement indécomposable 
et f(r, t) une trajectoire partout dense dans cet ensemble. Alors au moins l'une des 
semi-trajectoires de ce mouvement sera partout dense dans @. 

Admettons de nouveau que ce théoréme ne soit vrai, c’est 4 dire qu’aucune 
des semi-trajectoires f(r, ¢) ne soit partout dense dans 8. Désignons respective- 
ment par 0, et 9, les ensembles des points limites a et w du mouvement f(r, #). 

Les ensembles 0, et ©, sont des ensembles fermés invariants qui, selon nos 
hypothéses, seront les parties vraies de l’ensemble 6. De plus en vertu du 
théoréme précédent 6 = 0, + 0,. 

Les conclusions obtenues contredisent l’indécomposabilité de l’ensemble 0, 
et le théoréme est démontré. 
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Nous dirons qu’un ensemble invariant, compact et fermé est un ensemble 
quasi-minimal et nous le désignerons par le signe 2 si cet ensemble est dyna- 
miquement indécomposable ordinaire. 

TuHtorf&ME VII. Tout mouvement stable au sens de Poisson f(p, t) dont la 
trajectoire est compacte, appartient d un ensemble quasi-minimal Q, la trajectoire 
de ce mouvement étant partout dense dans Q. 

Désignons par © |’ensemble des points appartenant 4 la fermeture de la trajec- 
toire f(p, t). L’ensemble Q est invariant, compact et fermé; de plus la trajectoire 
du mouvement f(p, t) est partout dense dans 2. 

L’ensemble © est un ensemble quasi-minimal; en effet tout sous-ensemble 
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invariant fermé contenant f(p, ¢) coincide necessairement avec Q, done Q est 
indécomposable. 

Il nous reste 4 montrer que l’ensemble @ ne peut pas étre un ensemble excep- 
tionnel. En effet si cet ensemble était exceptionnel, alors en vertu du théoréme 
IV la trajectoire f(p, ¢) serait une trajectoire caractéristique et aucune de ses 
semi-trajectoires ne serait partout dense dans 2. Mais ceci contredit la défini- 
tion de la stabilité au sens de Poisson. Le théoréme est démontré. 

TutortmME VIII. Tout ensemble quasi-minimal Q contient un mouvement 
stable au sens de Poisson f(p, t), dont la trajectoire est partout dense dans cet en- 
semble. 

L’ensemble Q étant indécomposable, il en découle |’existence d’un mouvement 
f(r, }), dont la trajectoire est partout dense dans Q. L’ensemble Q étant un 
ensemble ordinaire, au moins l’une des semi-trajectoires du mouvement f(r, ¢) 
sera partout dense dans cet ensemble; admettons pour fixer les idées que cela 
soit la semi-trajectoire positive. 

Assignons nous un nombre positif arbitraire « et montrons que dans tout 
domaine relatif Go C Q il se trouvera un domaine G,, tel que tout point du 
domaine G; ayant quitté le domaine G , passera au moins une fois A une distance 
inférieure & e, de tout point de l’ensemble © et retournera ensuite dans le domaine 
Gy , le temps variant dans un sens positif ou négatif. 

Recouvrons l’ensemble 2 par un nombre fini de domaines G', G?, --- , G” dont 
les diamétres ne surpassent pas «. Considérons le systéme des domaines 
G', G?, --- , GY, GY ot Gt! = G. 

Des propriétés du mouvement f(r, ¢), dont la trajectoire est partout dense 
dans 2, découlent la possibilité de choix du point initial r tel que r C G, et 
l’existence d’une suite finie de nombres positifs croissants t:, 2, ---,tw, twas 
telle que 


f(r, t:) CG (i =1,2,---N,N +1). 


En vertu du théoréme sur la dépendance continue des trajectoires des condi- 
tions initiales, on peut décrire autour du point r un domaine G’ C G» assez petit 
pour que 


1G, t) C& @@ =1,2,---,N,N +1). 


Alors, le temps variant dans le sens positif, tout point du domaine G’ passera 
au moins une fois 4 une distance inférieure 4 « de tout point de l’ensemble 2 
et retournera ensuite dans le domaine Go. 

Considérons le domaine G” = f(G’, tvi1) C Go. Tout point de ce domaine 
jouit des mémes propriétés que les points du domaine G’, mais seulement pour 
le temps variant dans le sens négatif. 

Nous pouvons appliquer 4 présent au domaine G”’ les raisonnements que nous 
avons faits plus haut par rapport au domaine Gp, et construire un domaine 
G, CG” CG, tout point duquel passera au moins une fois 4 une distance infé- 
rieure & ¢ de tout point de l’ensemble Q et retournera ensuite dans Go. 

Prenant en considération que G; C G” nous arrivons 4 la conclusion que tout 
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point du domaine G,; C Gp» passera au moins une fois 4 une distance inférieure 
& « de tout point de l’ensemble © et retournera ensuite dans le domaine G) 
pour le temps variant dans les deux sens. 

Soit «, €,--+,€n., +++ une suite de nombres positifs convergeant vers zéro. 
Conformément aux nombres de cette suite construisons dans 2 une suite de 
domaines relatifs Go, G;, Gz, ---,Gn,--- telles que: 1) la fermeture de tout 
domaine soit contenue dans le domaine précédent, 2) les diamétres des domaines 
conyergent vers zéro et 3) tout point du domaine G;,.4; ayant quitté sa position 
initiale, passera 4 une distance ne surpassant pas €+1 de tout point de l’ensemble 
© et retournera ensuite dans le domaine G;, , le temps variant dans le sens négatif 
ou dans le sens positif. 

Le trajectoire f(p, t) déterminée par le point p appartenant a |’intersection 
de la suite des domaines que nous avons construits, appartiendera 4 chacun des 
faisceaux des trajectoires sortant de ces domaines. Cela étant, le point p, 
en décrivant sa trajectoire, traversera le voisinage arbitrairement petit de tout 
point de l’ensémble © et en particulier le voisinage arbitrairement petit de tout 
point de sa trajectoire, pour le temps variants dans le sens négatif ou dans le 
sens positif. 

Ceci veut dire que le mouvement f(p, ¢) dont la trajectoire est partout dense 
dans Q est stable au sens de Poisson. Le théoréme est démontré. 

Pour conclure citons un exemple simple d’ensembles quasi-minimaux. 

Introduisons sur la surface du tore des coordonnées orthogonales curvilignes: 
¢ l’azimuth du plan méridien et y l’angle polaire dans le plan méridien avec le 
k6éle dans le point d’intersection de ce plan avec |’axe du tore. 

Considérons sur la surface du tore l’ensemble de mouvements détérminés 
par les équations différentielles: 


dy dy 
1 —=-da —_ x= b 
(1) dt ; dt : 
ou a et b sont des constantes. 

Il est connu que a et b étant incommensurables les trajectoires ne sont pas 
fermées, et l’ensemble des points de chacune des trajectoires est partout dense 
sur la surface du tore. La surface du tore est un ensemble minimal, formé 
des points des trajectoires de l’ensemble des mouvements récurrents ayant la 
puissance du continu. 

Nous aurons un exemple d’un ensemble quasi-minimal qui ne sera pas minimal 
si dans l’exemple examiné du systéme dynamique les équations différentielles du 
mouvement seront remplacées par les équations suivantes: 

dy ee dy 2 eee 
2 —_—_— = Ps 2 —_ 2 —_ —_—_ = 2 —_ 2 —_ . 
(2) FT o( sin 5 + sin 9 a b{ sin 5 + sin 5 

Dans ce cas le caractére des mouvements changera de la maniére suivante: 
le mouvement du systéme (1) dont la trajectoire . 
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passait par l’origine des coordonnées, se décomposera dans le systéme (2) en 
trois mouvements différents: 1) le point de repos A l’origine des coordonnées et 
2) deux mouvements instables au sens de Poisson. Le caractére de tous les 
autres mouvements variera quant aux vitesses, cependant leurs trajectoires 
resteront les mémes, elles seront partout denses sur la surface du tore. Ces 
mouvement ne seront plus des mouvements récurrents, ce seront des mouve- 
ments stables au sens le Poisson. 

La surface du tore se trouvant étre dans ce cas un ensemble quasi-minimal ne 
sera pas un ensemble minimal car elle contient le point de repos comme sous- 
ensemble vrai, invariant et fermé. 


Moscou, U. R.S. 8S. 
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UBER DIVERGENZERSCHEINUNGEN DER LAGRANGESCHEN 
INTERPOLATIONSPOLYNOME STETIGER FUNKTIONEN 


Von Gtza GrinwaupD 
(Received January 22, 1936) 
Einleitung. 
Es seien 
(1) a", rs he a a'n) 
verschiedene im Intervall —1 < xz S +1 liegende Werte und 


(2) Te ae 


By weitere n reelle Werte. Dann liefert das Polynom 
; n(z) 

L,(z) - yy” , = ? 

(3) 2 * anla) (@ — a?) 


wr(z) = (x — x) (2 — af) --- (x — x) 





diejenige ganze rationale Funktion von héchstens (n — 1)-tem Grade, welche 
an den Stellen (1), der Reihe nach, die Werte (2) annimmt. Wenn die Zahlen 
(2) die von der Funktion f(x) an den Stellen (1) angenommenen Werte sind, so 
nennt man das Polynom (8) das Lagrangesche Interpolationspolynom der 
Funktion f(x); dasselbe, wie es aus dem Friiheren zu sehen ist, stimmt mit der 
Funktion an den Stellen (1) iiberein. 


Wenn wir die Zahlen (1) bei n = 1, 2, --- bilden, erhalten wir die Zahlen- 
menge 


a ) 


(2) 


a (2) 


» Te 


(3 3) 3 
a, 2, 2 


sree eee eeeeeeeee 


die wir eine Grundpunktgruppe der Lagrangeschen Interpolation nennen; die 
Zahlen (1) sind die Grundpunkte der Interpolation. 
Wir erhalten eine der interessantesten Grundpunktgruppen wenn wir als 
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Grundpunkte des Lagrangeschen Interpolationspolynoms n-ter Ordnung die 
Zahlen 


n Tv n 
‘ x\” = cos 5, a) = cos3 =, ---, 
n Tv n 
xy = cos (2k — 1) 5, ---, zn) = cos (2n — 1) = 


nimlich die Nullstellen des Tschebyscheffschen Polynoms n-ter Ordnung 


T(x) = cos (n are cos 2) 
nehmen. 
Vor kurzer Zeit habe ich gezeigt,! dass es fiir die Tschebyscheffschen Grund- 
punkte eine stetige Funktion f(x) gibt, fiir welche die Folge der Lagrangeschen 
Interpolationspolynome 


(5) Lilf(z)], Lal f(x), ot Ljlf(«)], ea 


im Intervall —1 S x S +1 fast iiberall divergiert, ja sogar fast iiberall un- 
beschrinkt ist (also die Menge der etwaigen Konvergenzpunkte von Masse 
0 ist). 

In dieser Arbeit werde ich diesen Satz folgendermassen verallgemeinern: 
Es seven die Grundpunkte der Lagrangeschen Interpolation die Zahlen 


a) = cos (2k — 1) (k = 1,2, --- ,n) 


dh. die Nullstellen des Tschebyscheffschen Polynoms n-ter Ordnung 
T,(x) = cos (n are cos 2) 


dann gibt es eine stetige Funktion f(x) deren Folge von Lagrangeschen Interpolations- 
polynomen in sémtlichen Punkten des Intervalls —1 S x S +1 divergiert, sogar 
unbeschrankt ist. 

Die Methode, mit welcher wir diesen Satz beweisen, zeigt viele Ahnlichkeit 
mit der in der oben zitierten Arbeit beniitzten. Wir wollen uns aber nicht auf 
diese Arbeit stiitzen und so skizzieren wir auch hier ganz kurz die Methode. 

Man nennt die Funktionen 

(n) = wn(2) — oe 

Li (x) = o (a(x a 2”) (k 1, 2, ,n) 
die Grundfuktionen der Interpolation. In der Untersuchung der Divergenzer- 
scheinungen der Folge (5) spielen die Funktionen (die Summe der absoluten 
Betriige der Grundfunktionen) 


(6) ated > 1 1%(a) | 


1 Giza Grinwatp, “Uber Divergenzerscheinungen der Lagrangeschen Interpolations- 
polynome,” Acta Litter. ac Scientiarum. 7. (1935), S. 207-221. 
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eine entscheidende Rolle. Namlich ein Satz von Haun besagt,” dass wenn die 
Funktionen (6) in einem Punkte x» eine unbeschrankte Zahlenfolge bilden, dann 
gibt es eine im Intervalle —1 < x S +1 stetige Funktion deren Lagrangesche 
Interpolationspolynome im Punkte x» unbeschrinkt sind. 

Nun gilt im Falle Tschebyscheffscher Grundpunkte die Abschaétzung 


(7) Aa(z) 2 =| Ta(2) | log n — ox(z) 


wo ¢,(z) und in Folgenden c2(x), --- nur von x abhangige positive Zahlen be- 
zeichnen. Aus dieser Ungleichung folgt, dass die Folge d(x), d2(x), ---, 
dn(x), --- in keinem Punkte des Intervalls —1 S x S$ +1 beschrankt sein kann, 
da sonst die Folge | T:(x) |, | Te(x)|, ---,| T(x) |, --- zu Null konvergieren 
wirde, was in Riicksicht auf 7',(z) = cos(n arc cos x) unméglich ist. Dieser 
Umstand erméglicht uns in allen Punkten x des Intervalls —1 S x S +1 eine 
stetige Funktion zu konstruieren, deren Lagrangesche Interpolationspolynome 
im Punkte x» unbeschrinkt werden. Zum Beweis unseres Satzes benGtigen wir 
anstatt (7) eine scharfere Abschatzung, die wir in §1 beweisen werden. Diese 
Ungleichung bezieht sich auf die Summe die aus (6) entsteht, indem man 
diejenigen Grundfunktionen ['”) (x) weglisst, die zu Abszissen x‘") gehoren, welche 
bei einer m diquidistanten Teilung des Intervalls —1 S x S +1 im selben Teil- 
intervall wie die Stelle x, oder in einem Teilintervall rechts davon liegen. Der 
Einfachheit halber behalten wir nur die Glieder mit ungeradem Index. Sie 
besitzen némlich in diesem Fall konstantes Vorzeichen. Die prizise Form der 
Ungleichung 


(8) An(x;m) = T(x) | log m — ¢(x) 


= | 
Qn 

zeigt, dass die Abschatzung in der Anzahl m der Teilintervalle gleichmissig gilt 
und dieser Umstand bewirkt es, dass wir einen Divergenzsatz fiir saimtliche 
Stellen des Intervalls —1 < x S +lerhalten. Wir gelangen zu unserem voll- 
stiindigen Satz nach zwei Schritten. Zuerst in §3. beweisen wir das Vorhanden- 
sein einer stetigen Funktion ¢(x), deren Lagrangesche Interpolationspolynome 
in simtlichen Punkten des Intervalls -—1 < x S +1 divergieren (sogar unbe- 
schrinkt werden). Der zweite wesentlich einfachere Schritt besteht darin, 
dass wir in §4. die Existenz einer stetigen Funktion ¥(z) zeigen, deren Lagran- 
gesche Interpolationspolynome in einem vorausgegebenen Punkte des Intervalls 
—1 S x S +1 unbeschrinkt werden, aber in den anderen Punkten beschrinkt 
bleiben. Es werden die beiden Funktionen ¢(x) und y¥(zx) effektiv gegeben. 
Die gesuchte stetige Funktion f(x) wird durch die Summe von ¢(zx) und ¥(z) 
gebildet. In den bisher nicht erwahnten §2. beweisen wir zwei auf Tscheby- 
scheffsche Grundpunkte beziigliche Hilfssitze. 





? Hans Haun, ,,Uber das Interpolationsproblem,” Math Zeitschrift, 1 (1918), S. 115-142. 
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1. Untere Abschatzung geeigneter, aus den absoluten Betrigen der Grund- 
funktionen gebildeter Summen. 





Oe ge 


' Von nun an verstehen wir unter 2\'(k = 1, 2, --- ,n) die Nullstellen des 






]  Tschebyscheffschen Polynoms n-ter Ordnung, dann lassen wir ebenso wie bei f 

] — I?(z) die oberen Indizes fort. Auch schreiben wir x, in der Form bs 

2k — 1 | 
Xe = cos vd, vy. = — (k = 1,2, ---,n). 





~ Qn 






Es ist offenbar 





1>um>a>--->2,> —1. 










Es sei —1 < x S +1; zerlegen wir das Intervall —1 < x < +1 in m gleiche 
Teile; rechnen wir jeden Teil den rechten Endpunkt hinzu, den linken aber 
nicht. Es sei x im (u + 1)-ten Teilintervall enthalten (von links gerechnet) 
wu = 0,1, 2,---,m— 1. Betrachten wir diejenigen ungeraden Werte von k, f 
fiir welche x; im ersten, zweiten, --- , oder u-ten Teilintervall liegt. Nehmen 
wir an, dass es solche Werte von k gibt; dazu geniigt es, dass x, und z,_; zum att 


ersten Teilintervall gehéren, x aber nicht, also dass t 






Witenes e 






ee ee ae 


l+maSe<it« thE 
< 





wegen 1 + 2p_1 = COS Jn_1 — COS T S 7 — Dp < 6/n (wenn namlich r 2 a> ci 
8 = 0, dann ist cos B — cos a < a — B wie zum Beispiel aus der geometrischen i | 
Bedeutung des kosinus hervorgeht) geniigt es also, wenn m die Ungleichung ' 


_ ee 
l+2 





ee nc en 







<ms 





wls 





erfiillt, was wir voraussetzen werden. E 
Es sei also 2v + 1 der kleinste ungerade Index, fiir welchen 72,4; in einem der 
ersten u Intervalle liegt, und betrachten wir die Summe’® 


| bw 
| Ae(z;m) = >) | leya(z) |. 


k=v 















Im Falle Tschebyscheffscher Polynome 





} sin 0; T. 
) h(a) = WP(@) = pO — = (ys OT 










sin Vok41 T(x) 
n LX — Lois of ’ 








lox+1(x) = 













3 [a] bedeutet die grésste ganze Zahl in a. 
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besitzt fiir die betrachteten Werte von k dasselbe Vorzeichen wie T',(x) woraus 


folgt, dass 
et. . | 
An(z;m) = b> mn Soogs dl 2; loxsa(a) | . 


— © — COS x41 iow 











Da aber sin 3/(x — cos #) eine monoton abnehmende Funktion von @ ist, so ist 
n—1 


2 
‘ — | T(x) | sin Dor41 
An(; m) sal — pe ESSER Sam 


X — COS Voxs1 
k=y 





1 <4 sin 3 
~~ 18 POs} MY 
aq | Pala) | sa 


IV 


Pov 


2 


nun ist aber v > 0 (da x, wegen 1 — x < 3; = 1/2n < 6/n S 2/m gewiss im 
letzten Teilintervall liegt) also 


. 2 
0 <t=- T41 = (x one X2y-1) + (2-1 beet Xey41) < m + Voy41 emi Joy—-1 


sic 
IIA 
} on 


wee 
m 


3|% 


m 


lA 


daher gewinnt man wegen | 7’,(x) | S< 1 


An(zj; m) = 25 sane | T (2) | (108 —< e(a)) > x | T(x) | log m — eg(zx). 








Wahlt man ¢;(z) = — ~ (ca(2) + log 5) so, dass ¢3(x) = = 5, 108 i ra ausfalle, so 
gilt diese Ungleichung auch fiir m S r3 falls dann unter \,(2; m) Null zu 
verstehen ist; also gilt fir -—1 S24 5+1,ms S35 

(9) Aa(a3 m) 2 J | Pa(z) | log m — o4(c). 


2. Hilfssatze iiber Tschebyscheffsche Grundpunkte. 


Hiurssatz 1. Es seien n und n’ teilerfremde ungerade Zahlen. Ist £& eine 
T schebyscheffsche Abszisse zugleich von n-ter und n’-ter Ordnung d.h. eine Nullstelle 
von T,(x) und T(x), soist— = 0. Ist & eine Tschebyscheffsche Abszisse zugleich 
von 2n-ter und 2n'-ter Ordnung, so ist entweder § = 1/+/2, oder — = —1/+/2. 
Endlich die Tschebyscheffschen Polynome n-ter und 2n’-ter, ferner n'-ter und 2n-ter 
Ordnung kénnen keine gemeinsame Nullstellen haben. 

Im ersten Falle ist 

2k —1. 2k’ — 1 


€ = cos on T= of (isksn,1sk' Sn’) 





, 1 
| T(x) | (toe yaa. “ch e(e)) ; 
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also ist 


2k—1 2k’ —-1 
Qn tiéi’ 





ein echter Bruch; auf reduzierte Form gebracht ist sein Nenner ein gemeinsamer 
Teiler von 2n und 2n’ daher ist der Bruch gleich 3, also 


— = cos 3x = 0. 
Im zweiten Falle ist 


2k —1 2k’ —1 
= = COs < < k’ F 
— = cos —* cos —e (lS ks 2n,1 Sk’ S 2n’) 


2k —1 7 2k’ — 1 
4n 4n’ 





wieder ein echter Bruch mit einem Nenner, der gemeinsamer Teiler von 4n und 
4n’ ist und offenbar nicht 2 sein kann; daher ist der Nenner 4, der Bruch selbst 


+ oder 2, also 
£ = cos dr = 1/+/2 oder & = cos 34 = —1/V/2. 


Endlich wenn 
2k -—1 2k’ —1 


cos wv = cos ——_f 
4n 2n’ 


x = cos ——— 
4n’ 


ware, so miisste 


Me—-1_ 2’-1, mb—-1_ 2-1 
4n Dn’ "Qn 4n’ 








sein, was offenbar unmdglich ist. 
Hitrssatz 2. Es sei 2v + 1 der kleinste ungerade Index, fiir welchen 


2//n > 1 + Zany. 


Dann ist 


hed 
b> 24.4(—1)| > clogn 


k=y 


woc > 0 eine absolute Konstante ist. 
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Es ist in der Tat 


[Fr] [FJ 








1 Sin Jex41 1 /*() sind 
p> (nm) (4) ae * els LE aoe : dd 
Ls Ue" 4.16 1) n p> 1 + CoS Sx41 27 — 1 + cos? 
1+ cos dy. 1/Vn 
= = 1 ; 
108 + cos? ait a/n ae 





o[*>*] 


3. Konstruktion einer stetigen Funktion, deren Interpolationspolynome im 


_Intervall -1 <2 S +1 divergieren. 


Es sei o, das Maximum der stetigen Funktion \,(x) fir —1 < ¢ S$ +1 und 
p(n) der grésste under den Werten oi, o2,---,on. Dann ist offenbar p(1) < 
p(2) < --- S p(n); ferner ist wegen d,(z\") = 1, p(n) 2 on 21. Fiir eine 
beliebige Funktion f(z), wenn —1 S f(z) S +l und» S nist, gilt fiir simtliche 
Punkte des Intervalls —1 S$ x < +1 die Ungleichung 


| LAf(z)]| s »» UY (a) | = r(x) S o, S p(n). 
=1 
Es sei 1, m2, --- eine unendliche Folge natiirlicher Zahlen mit (ni,n,) = 1 
(i,r = 1,2,---). Von nun an wird jedes n aus dieser Folge gewahlt. Es sei 


ni, eine beliebige Zahl der Folge (7; > 1). Teilen wir nun das Intervall —1 < z 
< +1in7z gleiche Teile. Der rechte Endpunkt wird jedesmal dem Teilintervall 
zugerechnet, der linke aber nicht. Nach Hilfssatz 1. k6nnen die gemeinsamen 
Nullstellen von je zwei der Tschebyscheffschen Polynome n;,, 27;,, 41, 2,41, 

++ y Mingo, 2QMige, +++ 5 Noe,, 2nai,-ter Ordnung nur 0, + 1/+/2sein. Wir kénnen 
also ein Polynom ¢;(x) vom Grade n;, bestimmen,* das in den Nullstellen von 
ungeradem Index der Tschebyscheffschen Polynome ni... und 2n;,,.-ter Ord- 
nung (ausgenommen die etwaigen Nullstellen 0 und + 1/+/2) wenn diese vom 
Endpunkte des s-ten Teilintervalls links liegen (s = 1, 2, --- , 71) den Wert +1 
annimmt; in den tibrigen Nullstellen der Tschebyscheffschen Polynome 1i,++ 
und 2n;,,.-ter Ordnung, eingenommen die Nullstellen 0 und + 1/+/2, es sei 
¢gi(z) = 0 und im ganzen Intervall —1 S$ x S +1 . 


| g(x) | S 2; 





‘ Es gilt naimlich der, mit dem Weierstrassschen Satz leicht beweisbare Satz: Es sei f(x) 
eine stetige Funktion im Intervall —1 S x S +1 und x, 22, -++, Lnj Yr, Y2, °° * y Yn VOrgegebene 
Zahlen die im Intervall —1 S x S +1 liegen, so gibt es zu jeden ¢ > 0 ein Polynom P(z), dass 
im Intervall -1 SxS +1 

| P(x) — f(z) |<e 
und : 


P(x) = ys 


wih uence TI 
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endlich geniigt 272 die folgenden Bedingungen 





21, < te, log t2 > 2%p(2ne;,) -2 






~~ es 


“oe 





Teilen wir nun das Intervall —1 S$ z S +1in iz gleiche Teile. Nach Hilfssatz 1. 
kénnen die gemeinsamen Nullstellen von je zwei der Tschebyscheffschen Poly- 
nome der Ordnung 7j,, 2n;,, Nissi, 2MNiz1, +++ y Nizts, Wigs, --- , Naiz, 2M, NUT 
+ 1/+/2 und Osein. Also kann man wieder ein Polynom g(x) vom Grade nj, 
bestimmen das in den Nullstellen von ungeradem Index der Tschebyscheffschen 
Polynome 7;,4s, 27,4.-ter Ordnung (ausgenommen die etwaigen Nullstellen 0 
und + 1/+/2) wenn diese vom Endpunkte des s-ten Teilintervalls links liegen 
(s = 1, 2,---,%), den Wert +1 annimmt; in den iibrigen Nullstellen der i 
Tschebyscheffschen Polynome n,,4,, 27;,4.-ter Ordnung und in den Nullstellen 

Ound + 1/+/2, gleich Null ist und im Intervall —1 < x < +1] g(r) | < 2; 

endlich erfiille 73 die folgenden Bedingungen: 


2te < 13, log 13 > 2%p(2ngi,) -3 i 













Ebenso definieren wir ¢3(x), ga(x), --- , ¢-(x), --- . Allgemein gilt: | 
(10) 2i,1 < t,, log i, > 27,7, | ¢-(x) | S 2 










wo 
(11) 


Bilden wir nun die Funktion 








— p(2nei,_,), n=l. Y 





¢r(z) 
2’ tT, i 





2 sibs Rea IT TIS 





g(x) = 





r=1 


Es ist klar, dass g(x) im ganzen Intervall -—1 < x S$ +1 stetig ist, und 
|y(x) | < 2. Wir beweisen, dass die zu den Tschebyscheffschen Abszissen mn || 
gehérigen Lagrangeschen Interpolationspolynome der stetigen Funktion ¢(z) in 
simtlichen Punkten des Intervalls —1 < x < +1 divergieren, ja sogar 
unbeschrinkt werden. 

Es sei x ein beliebiger aber fixierter Punkt des Intervalls -1 < 2 = +1. 
Zerlegen wir das Intervall —1 < x < +1 in 7, gleiche Teile. Es liege x im 
(s + 1)-ten Teilintervall. Bilden wir die Interpolationspolynome 1;,;, und 
2n;,,.-ter Ordnung der Funktion ¢(z) 
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_ - g(x) | 
Lani +s [y(x)] sad 2 Lan +s ES 
(13) < g(t) ] g(x) . ge(x) 
= 2 Leni +s ES + Lani +s FS + bo Lens +2 [e@] 
k=1 k=r+1 
=>i+>dD.+2ds. 
Nach Konstruktion sind aber fiir k = 1, 2, --- ,r — 1 die Polynome ¢;(z) von 
héchstens n;,-tem Grade; daher ist 
Lni,+sl9e(2)] = g(x), (k = 1, 2, mrss 1) 
Lon;,4el¢x(2)] - g(x), (k = 1, 2, a9 tng 1) 
und also 
~~ (2) S ex(z) 
a PAL ia PRL 
(14) os|2d,l= 2) Dini +s 32] 2 2 tr, 
r-l1 ( ) r—l ( ) 
ae ; PRL - PRT) | 
(15) os|XI| 2 Tess Ea |S) =| s2 











Nach der Definition von p(n), 


| Us| = 


-] 


gu) p(x) 
2 Ponies ES Diets E | 


p3 











(16) sii big 
i 1 
<2 GA Mints) <2 2 att 
2, 2* p(2naix) 2, 2 
und ebenso 
(17) Pai <t 


Nun untersuchen wir noch 22 und 33. 

Das Interpolationspolynom n;,,.-ter Ordnung der Funktion ¢,(x) stimmt, 
abgesehen von héchstens drei Gliedern (denen die etwaigen Nullstellen 0 oder 
+ 1/+/2 entsprechen) mit der im §1. abgeschatzten Summe Ani (2} tr) (m = tr, 
n = n;,) iberein. Nun ist nach einem Satz von Frsé&r' im Falle Tschebyscheff- 
scher Abszissen fiir jedes k und n 


| (a) | < V2 





’Leopotp Frs&ér, ,,Lagrangesche Interpolation und die zugehérigen konjugierten 
Punkte,’’ Math. Annalen, 106 (1932), S. 1-55, insb. Formel (28). 
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woraus folgt, dass 









IV 


us 





 & 
| bi | or, lr | cos (ni,4s are cos x) | log 7, — (x) — 34/2 





Ebenso 





: . & 
| po > = | cos (2n;,4. are cos x) | log i, — o(x) — 34/2 





also wegen (10) 





| po | > 5 | cos (m4. are cos r) | — (x) — 34/2 






r 
= a | cos (Ni,+2 are cos x) | ~ c,(x), 






IV 


’ 
| 2 | z | cos (273,42 arc Cos 2) | — (zx). r 
2 2r 






Endlich alle diese Ungleichungen zusammenfassend, ist 
| Lnipsal(z)]| = [21+ 22+ 2al 2 [Ze] — [21| — [2a | Raat 


(18) r , 
= | cos (ni,4s are cos r)| — a(x) — 2—1, | 
T 






IV 













) | Lonirelo(z)]| = |i +22 + 23] = | 22 — [24] — [25 | 
(19) r 
} = — | cos (2nj,,, are cos z)| — a(x) — 2-1. pn 
: ” i 


Nehmen wir nun an, dass die Lagrangeschen Interpolationspolynome der + || 
Funktion g(x) im Punkte xz beschrinkt blieben. In diesem Falle miisste wegen n 
a 
(18) und (19) | 
lim cos (n;,4s are cos x) = 0 


ro 











lim cos (2n;,4. arc cos x) = lim [2 cos? (nj,4. are cos z) — 1] = 0 


ro ro 





was ersichtlich unméglich ist. 







e 4. Konstruktion einer stetigen Funktion, deren Interpolationspolynome in 
4 einem einzigen Punkt unbeschrankt werden. 





Wir beweisen, dass es eine stetige Funktion y(z) gibt, deren zu den Tscheby- 
scheffschen Abszissen gehérigen Lagrangeschen Interpolationspolynome in 
simtlichen Punkten des Intervalls —1 < x < +1 beschrankt sind, aber im | 

|  Punkte —1 unbeschrinkt werden. By 
Bedeute p(n) dasselbe wie im vorigen § und es sei m < m2 < --- eine Folge 
der natiirlichen Zahlen mit (n,;, n,) = 1(i, r = 1, 2, --- ). 
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Wir definieren die Funktion y,,(z) folgendermassen. Sie sei gleich +1 in 

allen Nullstellen von ungeradem Index des Tschebyscheffschen Polynoms n,ter 

Ordnung auf welchem 


1+ ay < 2/0; 


gilt; in den iibrigen Nullstellen des Tschebyscheffschen Polynoms n,-ter Ordnung 
sei sie gleich 0. 

Es sei endlich y,,(z) = 0 im Intervall —1 + 2/+/n; S x S +1 und y,,(z) 1 
etwa differenzierbar. Dann mit der Hilfe Hilfssatz 2 (mit der Methode des 








vorigen §) ist leicht beweisbar, dass die Interpolationspolynome der stetigen ™ 
F Funktion (1A 
i _ r Wnj(2) 
Me v(z) = 54 ? p(n;-1) for 
wees i= 7 
a im Punkte —1 unbeschriinkt werden, aber in den anderen Punkten des Inter- L*( 
iy a valls —1 < x S +1 beschrinkt bleiben, wenn wir nur die Folge n; geeignet 
pa wihlen. 
AK Endlich erwaihnen wir noch, dass unser Verfahren auch im Falle anderer the 
Fs ee bekannter Grundpunkte erfolgreich ist; darauf kehren wir gelegentlich ein ‘ 
andersmal zuriick. (1. 
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ON NON-HARMONIC FOURIER SERIES 
By Norman LeEvinson* 


(Received March 5, 1936) 


1. Wiener and Paley! have generalized many properties of the Fourier func- 
tions {e"*} to sets {e*=*} where 
(1.0) In| <SD<4, -#<n<e, 
for some fixed D. 

They have proved that if (1.0) is satisfied then the set {e»*} is closed in 
[?}(—2, r) and possesses a unique biorthogonal set {h,(x)} such that for 


f(x) eL*(—1, x) 


the series 


-— . —ing — pidnz 

(1.1) > {c [ SG) emt ag — em PHB) halt axh 

converges uniformly to zero over any interval (—x + 6 S x S wm — 5) for any 
positive 6, and over any such interval the summability properties of 


(12) > em | $0) hale) a 


—oo 


are uniformly the same as those of the Fourier series of f(x). 

Wiener and Paley point out that whether or not 1/z* in (1.0) is the best possible 
constant is not evident. Malin? has in fact shown that the theorem holds with 
10/9x? in place of 1/7. 

Wiener and Paley also ask whether or not the theorem can be generalized to 
the case where f(x)eZ rather than L?. 

It is the purpose of this paper to show that the theorem remains true if 1/7’ 
is replaced by 3. Moreover it will be shown that this is the best possible 
constant. It will also be shown that there exists no general L theory but that 
there does exist an L? theory where 1 < p S$ 2. These results are contained in 
the following theorems: 





* National Research Fellow. 
1 Wiener and Paley, ‘‘Fourier Transforms in the Complex Domain.’’ Amer. Math. Soc. 
Coll. Pub. 19 (1934), Theorem XXXVIII. 
*H. Malin. Doctor’s dissertation at Mass. Inst. of Tech., 1934. 
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920 NORMAN LEVINSON 
TueoreM I. If {X,,} be a sequence of numbers and D a constant such that 


(1.3) ~<a ee< 85, —-xax<cn< ow, 
2p 
for some fixed p, 1 < p S 2, then the set {e'*"*} is closed L?(—-, mr) and possesses a 
unique biorthogonal set {h,(x)} such that for any f(x)eL?(—-7, m) the series (1.1) 
converges uniformly to zero over any interval (—r + 6 S x S mw — 4) for any sé > 0. 
Moreover the difference of the weighted sums (Riesz, Abel, etc.) of the non-harmonic 
and ordinary Fourier series also converges uniformly to zero over (—r +5 S28 
mw — 4). 
In other words the convergence and summability properties of the series (1.2) 
are exactly the same as for the ordinary Fourier series for f(z) over —4 < x < z. 
THeEorEM II. Jf (1.3) is replaced by 
p-l1 
-. Py Sit 
(1.4) lA. —n|s Op 
then the results of Theorem I no longer hold. 
Combining these two theorems, it is clear that there exists no general L 
theory of non-harmonic Fourier series since as p — 1, D— 0. 
In proving Theorem I the following result is of basic importance. 
THEOREM III. Let {d,} satisfy (1.3) and let 


(1.5) F(z) = II (1 - a1¢ - 2). 


Then there exists an absolute constant A such that 








L°) . A 
(1.6) [ _|F(@) Par < ——* ao, 
and 
” Ac” (p—1-2pbD) 
Pp 
(1.7) | F(x) |?dx < (p — 1 — Dpbyt 
fore > 1. 


As we shall see, to prove that Theorem III is a best possible one is quite simple. 


2. First we shall prove Theorem III. It is convenient to use 0 < Ax, k = 0,1, 
--+ , for absolute constants. We now prove 
Lema 1. If F(x) is defined as in Theorem III then 


2N 2N P(x) |2pD 
2.0 F(x) |?dx < AyN-?H9? | 
( ) [ | (x) | x < AyN [ \2N + 1 — 2|?? 





where 
2N 


(2.1) po) = [[ (#t*52-2—) 


n—-D—-—x-2 





and a, are constants such that0 < a, S 1. 
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Let N > 0 be some integer. If N < zx < 2QN it is clear from (1.3) that 





1+ 


> ie siete 
n—D~ . a 





x x 
| ; 0O<n< ao, 








0<n<N, 













































7 1-5 2(1-3|, IN<n<o, | 
Therefore for N < x < 2N | 
) 2N 
|F@)|s I] 1-55 /h+-25| 
; ‘ , jet iz | | 











And since for n > 0,n — D S Xn, it follows from this that 


y 
; A 3 
ith. * 
2N vt 
} ; 


































x x 
re) s |] 1-75 1+ 5 
(2.3) C) 2N " * eet 
x zx ca A 
| 1-2 + Ts || 
2N+1 N 





Let us recall the following well-known properties of the gamma function: 






T(x + 1) = aT (z) 










T 


rz) — 2) = sin rx 












(2.4) T(x + 1) ~ (2r)tatte-=, 
Using these on the right side of (2.3) we have for N < x < 2N 


r (F()| < P= D) Pe + D)|sin w(@ + D)|TQN +1+ D)PQN +1—D~2) 
; a Ti —D+1TQN +1—D)lQN+1+D-—-2) 


2N 
xT] 
N 
















A, —- 2x 
s—-D-s\| 





Using (2.4) we have 













An — 2 
- p= Zi 





| sin x(a + D) | il 
Qn +1—2)” I. 





(2.5) | F(x) | < A,N-+? 
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When |n — D — x| < 3 we have 


oA. tnd 
n—D-—-z2z 


When 3 S | n — D — | the inequality 


(2.6) 





[|sin xe + D)| Sx 





(2.7) l1+ise, —-o <t< ow, 
gives 
An — 2 An —-n+D An —-n+D 
Feri. SNe IE nena: ae he: - A A, ee sae 
Lo pos + Seas A eos 
(2.8) 








An -2+D 4D 
* exp (pnt? 4+ —O )| 


Combining (2.6) and (2.8) we have 























2N rn — 

I] “a poe | sin r(x + D) | 

N 
An —~-n2+D 

(2.9) srleo( 25 a +82 >) gop atpt»)\ 
2N 
An -~n+D 
< asiemp - n—-D—2z—-2%\ 








If we set an = (An — » + D)/2D then clearly 0 S a, S 1 and (2.9) becomes 














2N 5 2s a 
(2.10) IT An 2 _| [sin x(x + D)| < Avje™ 7 ** 
For |z| < 3 we have 
(2.11) le |S |1+2| 422). 


Using this in (2.10) we get 

















2N en 
An — 2 . a +t+n—-D—2x— 2%)? 
TT je paal sine +1 < 4s J] 5 Pr 
If we use this in (2.5) we have (2.0). 
Lemma 2. Let 
(2.12) fla) = (2 = % + tb) --- (= am + 0b) 





(x — a + 1b) «++ (x — dexy1 + 7b) 





cranes NOR 2 8 


OWN PAM SoD aceon 


FIN Nee Casbah ited e 
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where 0 S a < 2 < +++ < Ax Slandb=O0. IfO0 St < 1 we hare 


(2.13) ‘ | f(z) |'dx s — 






Let 








om (z — de) --+ (2 — ay) 
H(z) ead er +++ (Z — Aox43) 


Using the partial-fraction expansion for H(z) we have 


H) = i( C1 + Cs fos + mt) 
‘4 


== Be z2— a3 2 — Aex+1 












where clearly c; > 0,c3 > 0, --- , Coxy1 > 0. If we now set z = x + iy we have 










H(z) = ay ee ny ' 
nee (c-—ayPr+y . 7 (2 — Guy)? + y ls 


Thus RH(z) = 0 for y = 0, and therefore i} 
(2.14) | am H(z) | S 3m, y 
Using (2.14) and 













IV 
o 






RHA*(z) = | H(z) |* cos (t am H(z)) 
we get 4 


RHA*(z) RH*(z) L 
cos#rt  1—t' in 







(2.15) | H(z) |' s 









nc ornnaaaAnaat eee RAT RMI SH SN 






Since H(z) has no zeros or poles in y > 0, H‘(z) is analytic for y > 0 and its 
integral around any closed contour in that region is zero. Thus for any fixed A 
quantity c,1 >c> 0, . F 


2 1 
il dx H*(x + ic) + if dy H'(2 + iy) 


1 






a ae ead 





2 1 
-| az He +i) -6 [ dy H((— 1+ ty) = 0. 
—§ c 











4 ’ The constant 5 is of no particular significance in this inequality. By using a more 
refined contour in the proof of this lemma we can show 


y ° , , ie > Ee SO. Fee 
, ©) i se pas s = —5| (75) +( t YI 


As t—> 1 the right side of (*) approaches 2/(1 — ¢). Since 


1 dz 2 i i 
{ » le-tr 1-t ' ee 


the inequality (*) is asymptotically perfect as ¢ > 1. 
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Or 





if dx H(z +%c)| < [ dy | H(2 + ty) | 
(2.16) * y 


1 2 
+ [ ayia-itin i+ [deine +0 
0 = 


It is clear that for any fixed z 


1 


Min | 2 — da41| 
OSISk 





(2.17) | H(z)| < 
Using this in (2.16) we have 


2 
[ dx H'(a2 + ic)} $1+1+3 =5. 
1 





Thus 


2 
[ dx RH(x + ic) S 5. 


1 


And using (2.15) we have 


2 
(2.18) [ dx | H(x + ic) | < —_ 
foranyc,1>c>0. Ife = 1, (2.18) is trivial by (2.17). Ifc = 0, (2.18) holds 
since it holds for alle > 0. Thus (2.18) is true for all c = 0 and (2.13) follows 
immediately. 
Proor or THreoreM III. If P(x) is defined as in Lemma 1 and if we set 
y = (c — N+ 1)/(N + 2) we have 


i | P(x) |222 
N—1 |2N +1 — z |??? 








4 neon [* [yo to | y — Gan + tb [PP 
s 0 | y — ay + 4b |??? +++ | y — Oaeya + 2b [PPP 


where 0 S a; < a2 --- < Gexi1 S 1 and where b = 2/(N + 2). If we now use 
(2.0) and Lemma 2 we have 








1 — 2pD 
If we set N = 2”, 2™+1, ..- | and add we have 
be A Q-—m(p—1—2pD) 
F(x) |p dx < : ; 
[ii FG) bar s (p — 1 — 2pD) (1 — BpD) 


The theorem now follows at once. 





ON NON-HARMONIC FOURIER SERIES 925 


3. In proving Theorems I and II we require the following well-known theorem 
on Fourier transforms: 
THEeorEM A. Let f(x)eL?(— ©, ~) for some fixed p,1 < p S$ 2. Then 


(3.0) g(u) = (2m)-tLicm. | - f(z) em de, 


Ao 
(where 1.i.m. ts here of p/(p — 1) order), exists almost everywhere and 


(3.1) f(x) = (2r)- lim 2 ~ Ls) g(u) e-i“= du 


A-2o J—A A 


almost everywhere. Moreover if q = p/(p — 1) then 
(3.2) | (ony [ ” | gu) |e au |" < | (ey [ ” | Ha) ras |” 


20 


It is clear from (3.1) that 


A 
(3.3) f(x) = (2r)- lim g(u) e~** du 
Ava J—A 
wherever this limit exists except for a set of measure zero. 

As regards the closure of {e*»*}, it follows from results proved elsewhere‘ that 
the conditions of Theorem I imply the closure L?(—7, x) of {e®»*}. The closure 
of this set implies at once the uniqueness of {h,(z)} if it exists. 

We now turn to the proofs of several lemmas concerning 


(3.4) G(w) = (w — ») I (1 - £) (1 -,”) 


where 


sa < p—1 
(3.5) lA, —-n|SD< > 


We continue to use the A; as absolute positive constants. In Lemma 3 we 
will not use all of (3.5) but merely |\, — | S 3. 
Lemma 3. If w = u + iv then 


(3.6) | G(w) | < Aw (jw| + I et!*!, 
(3.7) | G(w) | = Ars |o| (A + | w)aer!*!, 
and 

(3.8) |G + i) | > Aw 





¢ Levinson, “On the Closure of {e****},’”” Theorem II. To appear in the Duke Journal of 
Mathematics. 
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a Ifu=0,|w|= 2, and we choose an integer N determined by N — 5 |wi|< 
{ N + 3, then 
|Z] <: for n> N. 






Also, since u > 0, w/X, lies in the right half plane. Therefore 





jt-g[efr--2 | n>N. 





Similarly 









2] >i for0O<n<N 





and therefore 






Obviously 
















w w 
jr- 2 efit 2], n>0O 
and 
ere > _l| 
Aw ~N+2 
Therefore 




















> ai ue on: st ~ 
w “ w an 
«el - ally] 








Using the gamma function in a manner similar to that in Lemma 1 we have 


sin r(w — 2) 
eecerstdee oe 


|G(0) | = Bj» — a) ) 9) a 








(3.9) x [Fe DTW +4 — wr +9 





> Ay 1! |e] ert rw — DTN +$~ wr +9 
1 +/w Ny] Mw + IN +2—w TW +4) 
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But® for x 2 0 
(3.10) log | (a + ty) | = (a — 4) log| x + cy| — y tan” 2 —2z+0(1) 


uniformly as |z + ty|— ©. If we note that fora = Oandz = 0, 


t — ytan” 
y tan” — ytan 





= lvtan= < y tan ¢| 


y | ay 
t+a e(r+a)+y 
we have, using (3.10) in (3.9), 

| v | eFl*| 
) > ee A SRE ) > 
|Gw) | 2 Aw oe lwi >%,u20. 

Since G(—w) is of the same form as G(w), the results hold for all values of u 
and for|w|> 43. By precisely the same argument with | w | < 3, and therefore 
with N now zero, it is obvious that the result is true for all w. Thus (3.7) is 
true. The other results of the lemma are carried through in a similar manner. 
(3.8) is of course a consequence of (3.7) when | v | is not small. 

Lemma 4. The functions {gn(x)} defined by 
* G(u) 


(3.11) gr(x) = Lim. on bn GD) e-iuz dy 





form a biorthogonal sequence to {e"*}, and g,(x) ¢€L*, gq = p/(p — 1). 
By Theorem III 


G(u) 


Pp 
ke: 


Thus by Theorem A, g,(x) defined as in (3.11) exists. 
For any fixed x > z it follows at once from (3.6) that 


a G(u)e-i“= i ta tte G(w)e-*? 


clits eee wee 


¥r(x) = 








where C is the closed contour formed by the line —A S u S A and the semi- 
circle | w| = A, —7 S amw S 0. 

Since the integral of an analytic function around a closed contour is zero, it 
follows that ¥,(z) vanishes forz > 7. But 


ga(2) = 16" OnF*(4 vate) — val2)) 





5 This is the real part of the complex form of Stirling’s formula 
log T(z) = (z — 4) log z — z + 3 log 2x + O(1) 


as | z|— © uniformly for —-7 +65 amzS2—6,6>0. 


Se es See So Se 





ee eee 
en ew ee 


——s 




















ig i 
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and therefore g,(z) = 0 for x > z, and similarly forz < —z. Using Theorem A 


again we have, therefore, 
G(u) , , 
= ™ tuz dr. 
re he 


From this it follows at once that 


° ' 0,mAn 
/ gr(xem dx = { 


l,m=n., 





Proor or THEorEM I. Let C denote a rectangular path in the complex ¢ 
plane with vertices at (V + 4 +%7M, -—-N-—}3+iM,—-N—34-—-iM,N + 
4 — iM). Let ¢(x) = 1for|z| < land ¢(z) = Ofor|z|> 1. Let G(u) be 
defined as in (3.4). 

Using residues and recalling Lemma 4, we have 











1 sii eifz 
Lim. rr “a G(u)e vf ag au 
1 - ei dnt eit® u 
_ “See Af — |> Foge= a) ~ ae Mare »| 
‘ : ‘tne _ Sin (N + 9)(@ — y) 
> gn(y)e ral 
If f(x) « L? then 
ye es ie guly)f(y)dy — - | sy pees te 7 Yay 
(3.13) 
1 4 io ez 
a Si “dy fly) L.i-m. 2 du G(u)e-*v vf at coe 


We want to show that the ais side of (3.13) tends to zero as N tends to 
infinity. Let 


N+ eitz—-Mz 


(3.14) Ii(z) = 











iy dy fly) Lim. [au ctayerse [a ‘Ge@+iM)u—'—iM)| 


This is in effect that part of the right side of (3.13) for which ¢ varies over the 
upper horizontal side of the rectangle C. By Theorem III 





(3.15) Gu) 
u+t 

and therefore, using Hélder’s inequality. 

(3.16) a 


(u +i? ° 
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1 a 1 4 g 
u—&—-itM wu-iM = (u—iM)(u—é-— iM) 





we have, using (3.16), 


' anf tn HO it ithe ou 
P ae 2). 
L(x) s | [ dy fy) Lim. | du Tar | yy * [GE + iM) | 


bad én Ree™ iuy N+} Eeit=—Mz 
+|([ mae —c —N-4 " G(E + 1M)(u — § — 1M)|’ 





where the second term on the right is absolutely integrable. Since f(y) « 1” and 
since the transform of (3.15) converges in the q** mean by Theorem A, we have if 
g(u) is the transform of f(y), 


he) & fe eT om? ue eae a 


G(u) g(u) £ e-Matite | 
+| [0 aio ae Sin ee PS - 








Or, for M > 1, 


" dé *| Gu) glu) | | 
w-4 [GE + 7M) | Je] u— ro 





I(x) s anenisl [ 


But by Theorem A, g(u) « L*. Using Hélder’s inequality we get 


+4 dé 
w—4 | G(E + tM) | 





I(x) s apn ei! [ 


where B is some constant depending on f(x), p, and D. If we now use (3.7) of 
Lemma 3, we get 


(3.17) I(x) < Av BN (M? + N2)? Mr! #0, 


Again let us consider that part of the right member of (3.13) for which ¢ runs 
over the right vertical side of the rectangle C. We denote this by 


I(x) = | [ dy f(y) 1.i.m. : du G(u) e~* 


=A 





M e721 
[eu aystemececical 
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If we split the range of the integral in u into the ranges (— A, —2N), (—2N, 2N), 
and (2N, A), then, proceeding in a manner similar to that used with J,(x), we 
can invert the order of integration in y and u, where in this case we use 


1 —— N+3+% — 
u—-N—4—i u—rm (u—r»)(u—N — $ — tm)’ 











on the ranges (—A, —2N), (2N, A). If we define g(u) as before, we have 


I(x) < [ _, de Gu) glu) f. “GIN +44 lee N-—4— 1) 














Gu +N e-7 
| [avout + pM es es 
Let 
Hn = — N + Ann 
Then 
lun —n|S 
and 











Gu+tN+4) _(ut+} “Tt (ut+4— un) (utd — mn 
Sent - GEE *) IG HE) GHEE). 
Let 

aie) = - 0 {I (1- 2) (0-2). 


Then 


e* 








I(x) < | f. du g(u + N + 3) Gu(u + 3) i. "Gin + De — wm) | 


Clearly 


1 
u— tm 


4 


< unless |u| < 1 and <1, 
[u + — mol | u| In| 











and therefore 


= |gu+N +4) Gru +3)| [* Aitnc 











+ [ dulou+ N+ 9 Guta] [ar — | 
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If we now make use of Lemma 3, we have for |x| < 


h(t) < Be | on Ett Dave +9 




























Bi 
pee ey, - 
(3.18) ( | x % u+?2 7 . 
4 / d N+3)Gy | d me 
oat 9 u|g(u + N + 3) @+Hl)] " (xt) Gala PD | 
For | 0 | = 1, 4 
-—™ 1 dem 
Gy(in +3) Gw(4) | Slql ver? dn Guin + 4) |’ 











Using (3.8) of Lemma 3, we have 





e7= 1 


Gu(in + 4) Gn(4) 








< of a. Me CS ) 
s\n] (52+ Max | G;(in +9) |) 












nS G y(z) 


= oper pm a 


Gy(in + 3) = 









where C can be taken as a circle of radius 10 about the origin. Using (3.6) of 
Lemma 3 we see that there exists an Aj such that | Gy(in + 3):| < Aw, || <1. 
Therefore 








e771 _ 3 | 
Gn(4 +) Gw(%) 


Using this we have 


< An | 7|, in| $1. 


















1 
| » | d 
+ Ae [ttl a 





sri 
~ | Gw(3) | -17 + Wu 


+ 2 An < An. 





ooo 
-1 (u — im) Guin + 9) 





a 
Au 





Using this in (3.18) we have 


Aig “ 
hia) st | 


1 
+ An [lout N +9) [du 
~1 





g(u + N + 3) Gr(u + 3) 
u+%— Mo 





du 















We know from Theorem A that 






(3.19) [ ” | eles) (edu < @. 














+ 
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Using Hélder’s inequality, we get 
Gy(u + 3) |? 


ne) s tf sat-n au} | [| o(ed lea fe 
+ top| [i hetu t a edu fel [7] See DP au] 


1 
+24n] [out N+ 3 [edule 














u+% —"Ho 


If we now use (3.19) and Theorem III, we get 
(3.20) lim I2(x) = 0, |x| Snr —6, 56> 0, 


No 
uniformly in z. This holds entirely independently of M. 

Let us denote that part of the integral on the right side of (3.13) for which ¢ 
varies along the lower horizontal side of the rectangle C by I;(z). Then clearly 
I;(x) satisfies the inequality (3.17) of I:(z). We also introduce an J,(x) similarly 
related to I2(x). 

By (3.20), we can make J2(x), and therefore J2(x) + I(x), arbitrarily small 
independently of M by choosing N sufficiently large. Having done this we can, 
for the particular value of N, make J,(x) + J3(x) arbitrarily small by making 
M sufficiently large. This proves that 


lim |e etme f” gula) $) dy — 1 f” py) BAPE — iv =0 


No 





uniformly in x for |x| <S 7 —6,5>0. 

To show that the summability properties of the two series are the same 
requires a slight variation of the ideas used above. We will take the case of 
Riesz summability of order a. How to proceed with various other types of 
summability will be obvious from this. We suppose that \» ~ 0. 

Let C; represent the rectangle with vertices at -iM, N + 4 —iM,N+3+4+ 
7M, 7 M and let C2 be the reflection of C; in the imaginary axis. Then as in 
(3.12) 


wi etfs . = 
Flim. [ se Lf agence! - hy) a 
3.21 apes gia) *] 
ated * L@ie-o\ "Kay * 
én ° iis Inn] \X 1 f* . we — y) ” 
= 2) nlwde (1 - 1) -1/ 008 , es (1 — u)* du. 


If we now multiply through by f(y) and integrate as in (3.13), we have to show 
that the repeated integral analogous to the left side of (3.13) tends uniformly 
to zeroas N — o if |2| <a—6. That this is true is clear if we observe that 





th 


o “Yr 


= _— 
», me 
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the integral from ¢ = iM to § = —iM in (C; is cancelled by the integral from 
§ = —tM tof = 1M in C: in (3.21) and thus we are really only concerned with c 
as it varies over the sides of the rectangle C of (3.13). This we can handle in a 
manner similar to that used on (3.13). Thus 


lim 2 | erm(1 ~ ely [ ; S(ygn(y) dy 


N-o 
— ging | n | "eh. . —in 
e (1 N44) 2 | Sue vdy| = 0 


uniformly for | x | S m — dif ky» # 0. 

If Xo = 0 we can modify the contours C, and C; by taking a small semi-circular 
path of radius p around the origin. In this case we have to consider not only 
that part of (3.21) for which ¢ varies over C but also a part involving 


[i eee corte L(t - wig) - 0+) 
“4n G@M(u—H) N+3/ — N+} 


where ¢ = pe‘. 
The fact that 





to) -(ty5) 
ean pe ee 
( V+} t+ 


has a zero at ¢ = 0 is what allows us to discard this part of the left side of (3.21). 
Thus the Riesz summability of the two series are the same. The same method 
applies to Abel and many other types of summability. 
We have completed the proof of Theorem I, since we have shown the existence 
of a set {gn(x)} satisfying all the requirements of {h,(x)}. Therefore we now 
take for {h,(x)} in Theorem I the set {g,(zx)}. 


— ——— 
SR 
we 













Se png te 







LS eee 
a ya ae i, 
Aes 





—- 
~~ 
ee 





ne 














4. Proor or THEOREM II. Here we show that the set 





—1 
Wenn He. n> 0, 


(4.0) Ao == 0, 


we 
ham nt Fo, n>0, 


which obviously satisfies (1.4), does not have the properties which we have shown 
sets satisfying condition (1.3) to have. 

Let us suppose that {e’**} has a biorthogonal set {h,(x)} and let us consider 
ho(x). Clearly ho(x) must be integrable. Therefore there exists 


(4.1) H(w) = [ ho(x) e*? dx 


T 
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and moreover 
H(,) = 0, n ~ 0, 
and H(0) = 1. Therefore 





Ms 52 
(4.2) H(w) = ow) |] (1 - -~—— 
1 (n <= a) 
2p 
where #(w) is an entire function. From (4.1), 
(4.3) | H(u + iv) | S etl [ | ho(x) | dx < Cell 


for some constant C. ) 
Using the gamma function we have 


i] 


gw) = |] ee r= 1) 


1 (n - 2 


+1 p—1\. p—1l 
Be )r (w + Op ) sin x(w + 5) 
3 


1 
"(e+ ?ap) 





(4.4) 





Using (3.10) we have for |v| > 1 

e™ || 
lu + air 4 1° 
Using this and (4.3) in H(w) = $(w)g(w), we have 
(4.6) CAz2(|w |"? + 1) > | e(w) |, 


Using the Hadamard product theorem to write H(w) as an infinite product, 
it is clear that ¢(w) is at most of order 1. Thus by a well-known result of 
Pragmén and Lindeldf, (4.6) implies that ¢(w) = O(| w |"). Since p > 1 it 
follows that ¢(w) is a constant. Thus (4.1) becomes 


[ ho(x)e™* dx = g(u). 


(4.5) |g(u + tv) | > Are 





lv| > 1. 


By (8.3) of Theorem A 


A 


h(x) = lim — 


lim 5: a g(u)e—“ du 


(wherever this limit is defined). Since g(u) is even, we have 


A 
ho(x) = lim : g(u) cos ux du. 
0 


Ao 





ON NON-HARMONIC FOURIER SERIES 


Or 


T? (22 ) r r(u a Maes ') sin r(u aa tea. ') 
hex) = —\— > 2p 


f= jim COs UX du 
a 


Apo J0 p+il 
(w+ ) 


We now use a more refined form of Stirling’s formula which for real u gives 















ru +1) = Qn urtte[ + o( )| 


. 
u 





asu— oo, Using this we have 








A 


ho(x) = Ci lim cos ux sin 3 (. + 2) u-"/? du + O(1) 


Aw J0 


where 
C, = eer? (2+!) 7. 
2p 


But 


2 cos ux sin + (u + e=*) = sin EC + 2-1) + us| 
2p 2p 


Thus for x = 0 


ho(x) = 34C, lim : sin Ec + p=) -- uz | up du + O(1). 


AO 





Therefore for r > x = 0 
ha(a) = 4C, sin = (1 ‘ ‘) ( — 2) + O(1). 


Since ho(x) is even, we have 


ho(x) = C2(r — |x|)” + O(1), |a| <x. 
If 






1 
(x — |x|)" log (w — | x1) 





f(z) = 
for 3  < |x| < wand zero elsewhere, then 


& | f(x) |Pdx < @. 


Tv 





+ sin | e( 2=) ~_ ue |, 
2p 


a ee, 


Tapa 
= 
seer 


— 


ihe 
= 


= 
Sa 
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But clearly 


it is obvious that 


NORMAN LEVINSON 


|" ho(x) f(a) dx 


g(u) ~ Csu-/? sin + ( 


= 0, 


p—l 
u+ Dp 


[oc |P?du = © 


PRINCETON UNIVERSITY AND THE INSTITUTE FOR 
ADVANCED Sroupy. 


Therefore the conclusions of Theorem I cannot hold for the set {,»} given by 
(4.0). In other words, condition (1.3) cannot be relaxed to (1.4) without 
disrupting the entire theorem. 

Incidentally since for u — 


) 


which shows that Theorem III is a best possible theorem. 
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